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Group: 1646 <u 
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CFR 1 .53(b), please file a [3 continuation-in-part 
of the pending prior PATENT APPLICATION of: 
Inventor: YOKOI et al 
Serial No.08/765,337 
Filed: October 6, 2000 

For" HG-CSF FUSION POLYPEPTIDE HAVING C-MPL ACTIVITY, DNA CODING FOR SAME 
AND METHODS OF TREATING ANEMIA USING SAME(AS AMENDED) 

Assistant Commissioner for Patents 
Washington, DC 20231 
Sir: 

This request for filing under Rule 53(b) is made by the following named inventor(s) (using the above-identified title): 
Inventor(s): YOKOI et ai 

[><] Attached is a true copy of the prior application as originally filed including the specification, claims, Oath/Declaration 
and drawings (if any) and abstract (if any). No amendments (if any) referenced in the Oath or Declaration filed to 
complete the prior application introduced new matter. 

Priority is hereby claimed under 35 USC 119 based on the following foreign applications, the entire content of which is 
hereby incorporated by reference in this application: 

Application Number Country Dav/Month/Year/Filed 

Hei 7-1 02625 Japan 26 April 1 995 

PCT/JP96/01 1 57 26 April 1 996 

□ certified copy(ies) of foreign application(s) attached or 

□ already filed on _____ inpriorappln.no. filed 

ES already filed in PCT/JP 96/01157 filed 26 April 1996 



=3_X] The prior application is assigned to Kyowa Hakko Kogyo Co., Ltd.. 
S[X] Power of Attorney has been granted to Arthur R. Crawford et al, Reg. No. 25,327 of Nixon & Vanderhye P.C., 1100 N. 
!1 Glebe Rd., 8 th Fir, Arlington, VA 22201 . 

fM Address all future communications to: Nixon & Vanderhye P.C., 1 100 N. Glebe Rd., 8 tn Floor, Arlington, VA 22201 . ^ 
Please amend the specification by inserting before the first line -This is a continuation-in-part of application Serial No. 

n • 09/765,337, filed December 23, 1996, now pending, the entire content of which is hereby incorporated by 

reference in this application.-- *• 

:!Q "Small entity" statement of record. □ "Small entity" statement attached. 

!3 0 Petition filed in prior application to extend its life to insure copendency. 
_X] The Examiner's attention is directed to the prior art cited in the parent application by applicant and/or Examiner for the 
reasons stated therein. 

!g] Please enter the attached and/or below preliminary amendment prior to calculation of filing fee: 

Attached Preliminary Amendment, Information Disclosure Statement, Declaration re Deposited Materials, 
Letter re transfer of sequence listing from parent case and copy of sequence listing. 

i_X] The entire disclosure of the prior application above-referenced is considered as being part of the disclosure of this 
new application and is hereby incorporated by reference therein. 

FILING FEE IS BASED ON CLAIMS AS FILED LESS ANY HEREWITH CANCELED 

Basic Filing Fee 

Total effective claims 9 - 20 (at least 20) = 0 x$ 18.00 
Independent claims 2 - 3 (at least 3) = 0 x $ 80.00 
If any proper multiple dependent claims now added for first time, add $270.00 (ignore improper) 

SUBTOTAL 
SECOND SUBTOTAL 
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710.00 
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710.00 
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710.00 
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0.00 
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710.00 



If "small entity," then enter half (1/2) of subtotal and subtract 
Assignment Recording Fee ($40.00) 

TOTAL FEE ENCLOSED 

Any future submission requiring an extension of time is hereby stated to include a petition for such time extension. 
The Commissioner is hereby authorized to charge any deficiency in the fee(s) filed, or asserted to be filed, or which 
should have been filed herewith (or with any paper hereafter filed in this application by this firm) to our Account No. 14- 
1 140. A duplicate copy of this sheet is attached. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re CIP of Patent Application Serial No. 08/765,337 of 

YOKOIetal Atty. Ref.: 249-118 

Serial No. to be assigned Group: 1646 

Filed: October 6, 2000 Examiner: Mertz 

For: hG-CSF FUSION POLYPEPTIDE HAVING c- 
mpl ACTIVITY, DNA CODING FOR SAME 
AND METHODS OF TREATING ANEMIA 
USING SAME(As Amended) 

October 6, 2000 

Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

PRELIMINARY AMENDMENT 

In order to place the above-identified application in better condition for 
examination, please amend the application as follows: 

IN THE TITLE 

Amend the title to read as follows: - hG-CSF FUSION POLYPEPTIDE 
HAVING c-mpl ACTIVITY, DNA CODING FOR SAME AND METHODS OF 
TREATING ANEMIA USING SAME-- 

IN THE SPECIFICATION 

Page 11 , line 24, delete "No. 4, 5 or 6" and insert --NOS: 5, 7 or 9-. 
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YOKOI et al 

Serial No. to be assigned 

Page 12 , line 26, delete "No. 4, 5, or 6 n and insert -NOS: 4, 6 or 8--. 
Page 13 , line 7, delete "No. 4, 5 or 6 M and insert -NOS: 4, 6 or 8--; and 
line 13, delete "No. 4, 5 or 6" and insert - NOS: 4, 6 or 8--. 

IN THE CLAIMS 

Cancel claims 1-9 and add the following new claims: 
--10. A fusion polypeptide which comprises a human granulocyte colony 
stimulating factor polypeptide and c-mpl ligand polypeptide encoded by a nucleotide 
sequence selected from the nucleotide sequence SEQ ID NOS: 4, 6 and 8 wherein the 
fusion polypeptide has no mouse IL-3 activity. 

1 1 . The fusion polypeptide according to claim 10 in which the human granulocyte 
colony stimulating factor polypeptide is fused via a spacer peptide to a c-mpl ligand 
polypeptide. 

12. A fusion polypeptide encoded by a nucleotide sequence selected from the 
nucleotide sequences shown in SEQ ID NOS: 4, 6 and 8. 

13. The fusion polypeptide of claim 10 chemically modified with a polyalkylene 
glycol derivative. 

14. The fusion polypeptide according to claim 13 wherein the polyalkylene glycol 
derivative is a polyethylene glycol derivative, a polypropylene glycol derivative or a 
polyoxyethylene-polyoxypropylene copolymer derivative. 
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YOKOI et al 

Serial No. to be assigned 

15. A pharmaceutical composition for treating anemia comprising the fusion 
polypeptide of claim 10 in a pharmaceutically acceptable carrier, vehicle or auxiliary 
agent. 

16. A method of treating anemia comprising administering to a subject in need of 
same an effective amount of the fusion polypeptide of claim 10. 

17. A method of simultaneously amplifying platelets and neutrophils comprising 
administering to a subject in need of same an effective amount of the fusion polypeptide 
of claim 10. 

18. A method of controlling formation of megakaryocyte colonies and neutrophil 
colonies and/or controlling differentiation or maturation of megakaryocyte precursors and 
neutrophil precursors comprising administering to a subject in need of same an effective 
amount of the fusion polypeptide of claim 10.-. 



-3- 



463712 



YOKOI et al 
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REMARKS 



This application is a continuation-in-part of application Serial No. 08/765,337 and 
is identical to the disclosure of that application with the exception of a single sentence 
added to page 5 of the specification, first full paragraph. This statement indicates that the 
fusion protein of the present invention has no mouse IL-3 activity. This is an inherent 
property of the fusion protein and is illustrated in the examples at pages 50 and 51. 
Applicants retain their priority rights to both the underlying PCT and Japanese priority 
applications in accordance with In re Davies et al 111 USPQ 381 (CCPA 1973). 

The claims now before the examiner are the same claims that were presented in 
the parent application with the amendment filed July 27, 2000. The title of this 
application has been amended and also passages at pages 11-13 of the specification 
adjusted. 

With this filing there is a comprehensive Information Disclosure Statement 
bringing forward the art of record in the parent application. 

It is submitted that claims 10-18 are in condition for allowance. Favorable action 
is solicited. 



ARC:pfc 

1 100 North Glebe Road, 8th Floor 
Arlington, VA 22201-4714 
Telephone: (703) 816-4000 
Facsimile: (703) 816-4100 



Respectfully submitted, 



NIXON & VANDERHYE P.C. 




Reg. No. 25,327 
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SPECIFICATION 
NOVEL POLYPEPTIDES 

TECHNICAL FIELD 

The present invention relates to a fusion polypeptide 
comprising a polypeptide having a granulocyte colony 
stimulating factor (hereinafter referred to as "G-CSF") 
activity arid a polypeptide having a platelet growth factor 
( thrombopoietin, hereinafter referred to as "TPO") activity, 
and DNA which codes for the fusion polypeptide. Since the 
fusion polypeptide of the present invention can form and 
amplify platelets and neutrophils simultaneously, it is useful 
for the treatment of anemia _and the^like., 

BACKGROUND ART 

Blood comprises hematopoietic cells such as 
erythrocytes, leukocytes, platelets and the like* These 
hematopoietic cells mature from only one kind of pluripotential 
blood stem cell through various differentiation steps. These 
steps undergo complex regulation by a group of proteinous 
factors which are generally referred to as cytokines . A 
certain type of cytokine takes part in the differentiation and 
multiplication of various hematopoietic cells . On the other 
hand, a certain type of hematopoietic cell undergoes regulation 
of its differentiation and multiplication by various types of 
cytokines. This is called overlapping cytokine actions. Among 
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these cytokine members, TPO and G-CSF are considered to have 
small overlapping actions . 

TPO mainly takes part in the formation of platelets. 
Platelets are formed by the fragmentation of megakaryocytes, a 
hematopoietic cell which kas^ large nucleus and is present 
mainly in bone marrow. Platelets are essential for forming 
blood clots* at damaged portions in blood vessels. Platelets 
also play important roles in not only blood coagulation but 
also injury healing by releasing proteins having other 
functions at the damaged portions. A significant decrease in 
the number of platelets may be fatal, because the body may 
easily bleed. _ 

G-CSF is a cytokine which accelerates activation of 
neutrophils, a member of the leukocytes, and differentiation of 
neutrophils from their precursor cells. Neutrophils exert the 
first defense action when invaded by foreign enemies such as 
bacteria, viruses and the like. When the number of neutrophils 
is decreased, the body becomes defenseless against infection, 
and this too is also often fatal . 

Current medical treatment of cancers often cause side 
effects in which pluripotential blood stem cells are damaged by 
the administration of a chemotherapeutic drug, irradiation of 
X-rays or bone marrow transplantation for the treatment of 
leukemia, thus decreasing the number of all hematopoietic 
cells. Apparently, it is markedly beneficial for thrombopenia 
and leukopenia patients to amplify the number of these cells by 
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the administration of cytokine, to suppress bleeding tendency 
and preventing infectious diseases . 

A cytokine which can amplify platelets and neutrophils 
simultaneously has not been found, and there is no medicine 
having such an effect. 

Leukemia inhibiting factors, stem cell factors, 
macrophage colony stimulating factors, granulocyte/macrophage 
colony stimulating factors, erythropoietin, interleukin (IL)-3, 
IL-6, IL-11, megakaryocyte colony stimulating factors and the 
like are known as substances which amplify platelets or enhance 
differentiation and multiplication of megakaryocytes [Metcalf 
et al., Blood, QQ, 50-56 (19.90); -Hunt et mI. , Blood, 10/ 904- 
911 (1992); Examined Japanese Patent Publication No, 6-11705; 
Hoffman et al., Blood Cells, 13, 75-86 (1987); Mazur e£ al., 
Exp. Hematol., 15, 1123-1133 (1987); McNiece et al. , Exp. 
Hematol., 16., 807-810 (1988); Lu et al., Brit. J. Hematol., 70., 
149-156 (1988); Ishibashi et al., Proc. Natl. Acad. Scl. USA, 
86 , 5953-5957 (1989); WO 95/21919; WO 95/18858], It is 
understood that these many cytokine members amplify platelets 
by overlapping actions. Recently, it was revealed that a 
receptor ligand called c-mpl is a cytokine which has the ^) 
highest activity among platelet amplifying factors and acts 
directly [de Sauvage et al. , Nature, 369 , 533 (1994)]* 

As substances which multiply granulocytes , the above- 
mentioned IL-3, macrophage colony stimulating factors, 
granulocyte/macrophage colony stimulating factors and the like 
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are known, but G-CSF has the highest activity in terms of 
multiplying neutrophils selectively [Nicola et ai., J. Biol. 
Ch&m. , 253 , 9017 (1983)]. With regard to a polypeptide in 
which two different kinds of cytokine are fused, there are 
reports in Japanese Published Unexamined International Patent 
Application No. 500116/94, U.S. Patent 5,359,035, Exp. 
Hematol., 21, 647-655 (1993) and Ibid., 18, 615 (1990) and the 
like . 

However, nothing is known about a fusion polypeptide in 
which TPO is used as one of the fused cytokines . 

An object of the present invention is to provide a 
fusion polypeptide which can produce and amplify platelets and 
neutrophils simultaneously. This fusion polypeptide allows the 
formation of megakaryocyte colonies and neutrophil colonies and 
the differentiation or maturation of megakaryocyte precursor 
and neutrophil precursor can be controlled. 

DISCLOSURE OF THE INVENTION 

The present invention relates to a fusion polypeptide 
which comprises a polypeptide having G-CSF activity and a 
polypeptide having TPO activity and DNA which codes for the 
fusion polypeptide. Also disclosed are fusion polypeptides in 
which a polypeptide having G-CSF activity and a polypeptide 
having TPO activity are fused via a spacer peptide and DNA 
which codes for the fusion polypeptide; and a polypeptide in 
which the fusion polypeptide comprising a polypeptide having G- 
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CSF activity and a polypeptide having TPO activity is chemically modified with 
a polyethylene glycol derivative. Also provided are anemia-treating 
compositions containing the fusion polypeptide as an active ingredient. 

The fusion protein of the present invention has no mouse IL-3 activity. 

As the polypeptide having G-CSF activity for use in the present 
invention, any protein may be used with the proviso that it has the requisite G- 
CSF activity, such as a polypeptide having the amino acid sequence shown in 
Table 1 {Nature, 319, 415 (1986)] . 

Also useful is a protein which has an amino acid sequence derived from 
the amino acid sequence shown in Table 1 by substitution, deletion or addition 
of one or more amino acids, and examples thereof include hG- 
CSF derivatives shown in Table 2 and described in Japanese published 
Unexamined Patent Application No. 267299/88, Japanese Published 
Unexamined Patent application No. 299/88, and Japanese Published 
Unexamined International Patent Application No. 500636/88. 



TABLE 1 



X ThrProLeuGlyProAlaSerSerLeuProGliiSerPheLeiiLeu 
15 10 15 

LysCysLeuGluGlnYalArgLysIleGlnGlyAspGlyAlaAlaLeu 

20 25 30 

GlnGluLysLeuCysAlaThrTyrLysLeuCysHisProGliiGluLeu 

35 40 45 

Va HeuLeuGlyHi sSerleuGlyl 1 eProTrpAl aProLeuSerSer 

50 55 60 

CysProSerGlnAlaLeuGlnLeuAlaGlyCysLeuSerGlnLeuHis 

65 70 75 

SerGlyLeuPheLeuTyrGlnGlyLeuLeuGlnAlaLeuGluGlylle 
80 85 90 95 

SerProGluLeuGlyProTlirLeuAspThrLeuGliiLeuAspYalAla 

100 105 110 

AspPheAl aTJirThrl 1 eTrpGlnGlnMe IGluGluLeuGlyMe tAl a 

115 120 125 

ProAl aLeuGlnProTiirGliiGlyAl aMe tProAl aPheAl aSerAl a 

130 135 140 

PheGlnArgArgAlaGlyGlyValLeuValAlaSerHisLeuGliiSer 

145 150 155 

PlieLeuGluYalSerTyrArgValLeuArgHisLeuAlaGlnPro 
160 165 170 174 

( X represents H or Met . ) 
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As the polypeptide having TPO activity for use in the 
present invention, any protein may be used with the proviso 
that it has the requisite TPO activity, such as the c-mpl 
ligand which is a polypeptide Having the amino acid sequence 
shown in Table 3 [Nature r 369 , 533 (1994)], as well as leukemia 
inhibiting factors, stem cell factors, macrophage colony 
stimulating factors, granulocyte/macrophage colony stimulating 
factors, erythropoietin, interleukin (IL)-3, IL-6, IL-11, 
megakaryocyte colony stimulating factors and the like. 
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TABLE 3 

SerProAlaProProAlaCysAspLeuArgValLeuSerLysLeu 
1 5 - 10 15 

LeuArgAspSerHisValLeuIIisSerArgLeuSerGlnCysPro 
20 25 30 

GluYalHisProLeuProThrProValLeuLeuProAlaValAsp 
35 40 45 

PlieSerLeuGlyGluTrpLysThrGlnMetGluGluThrLysAla 
50 55 60 

GIMspIleLeuGlyAlaValThrLeuLeuLeuGluGlyValMet 
65 70 75 

AlaAlaArgGlyGInLeuGlyProThrCysLeuSerSerLeuLeu 
80 85 90 

GlyGlnLeuSerGlyGlnValArgLeuLeuLeuGlyAlaLeuGln 
95 100 105 

SerLeuLeuGlyThrGlnLeuProProGlnGlyArgThrThrAla 
110 115 120 

Hi sLysAspProAsnAl a 1 1 ePheLeuSerPheGlnHi sLeuLeu 
125 130 135 

ArgGlyLysYalArgPheLeuMetLeuValGlyGlySerThrleu 
140 . —145 . 150 
CysValArgArgAl aProProThrThrAl aValProSerArgTiir 
155 160 165 

SerLeuValLeuThrLeuAsnGluLeuProAsnArgThrSerGly 
170 175 180 

LeuLeuGluThrAsnPheTlirAlaSerAlaArgThrTiirGlySer 
185 190 195 

GlyLeuLeuLysTrpGlnGlnG lyPheArgAl aLys 1 1 eProGly 
200 205 210 

LeuLeuAsnGlnThrSerArgSerLeuAspGliiI 1 eProGlyTyr 
215 220 225 

LeuAsnArgl leHi sGluLeuLeuAsnGlyThrArgGlyLeuPhe 
230 235 240 

ProGlyProSerArgArgThrLeuGlyAlaProAspIleSerSer 
245 250 255 

GlyThrSerAspThrGlySerLeuProProAsnLeuGlnProGly 
260 265 270 

TyrSerProSerProThrHisProProThrGlyGlnTyrTlirLeu 
275 280 285 

PheProLeuProProThrLeuProThrProYalYalGlnLeuHis 
290 295 300 

ProLeuLeuProAspProSerAlaProThrProThrProTiirSer 
305 310 315 

ProLeuLeuAsnThrSerTyrThrllisSerGlnAsnLeuSerGln 
320 325 330 

GluGly 
332 
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The polypeptide having G-CSF activity and the other 
polypeptide having TPO activity, which constitute the fused 
polypeptide of the present invention, are not particularly 
limited, provided that they "contain respective activity- 
producing portions. For example, when the c-mpl ligand is used 
as the polypeptide having TPO activity, it may contain an amino 
acid sequence of the 153rd and 154th positions counting from 
the N-terminal amino acid. 

Also included in the polypeptide of the present 
invention is a polypeptide in which a polypeptide having G-CSF 
activity and a polypeptide having TPO activity are fused via a 
spacer peptide. As the spacer peptide, any sequence may be 
used with the proviso that it does~not spoil the G-CSF activity 
and TPO activity. For example, the peptide shown in Table 4 
can be used as the spacer peptide. 

TABLE 4 

Linker 

( GlyGlyGlySer ) 3 Arg 
( SerGlyGlyGly ) <Arg 
SerGlyGlyGlyArg 
(SerGlyGlyGly) 4 
SerGlyGlyGly 
(GlyGlyGlySer) 3 
(GlyGlyGlySer) 2 
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Examples of the fusion polypeptide of the present 
invention include a polypeptide having the amino acid sequence 
shown in Sequence ID No. 1, 2 or 3 and a polypeptide derived 
from the amino acid sequence *bf the fusion polypeptide by 
addition, deletion or substitution of one or more amino acids 
within such a range that the G-CSF activity and TPO activity 
are not spoiled, having a homology of 40% or more with the 
amino acid sequence of the polypeptide. The homology is 
preferably 60% or more, and more preferably 80% or more. 

The substitution, deletion or addition of amino acids 
can be carried out in accordance with known methods described 
for example in Nucleic Acid Research, 10., 6487 (1982); Proc. 
Natl. Acad. Sci., USA, 79., 6409 (1982); Proc. Natl. Acad. Scl. , 
USA, 81, 5662 (1984); Science, 224, 1431 (1984); PCT WO 
85/00817; Nature, ,316, 601 (1985); Gene, 34, 315 (1985); 
Nucleic Acid Research, 13., 4431 (1985); and "Current Protocols 
in Molecular Biology", Chap. 8, Mutagenesis of Cloned DNA, John 
Wiley & Sons, Inc. (1989). 

Also included in the fusion polypeptide of the present 
invention is a peptide having an amino acid sequence in which 
a secretion signal peptide is added to the N-terminal amino 
acid of the above-mentioned polypeptide; examples include a 
polypeptide having the amino acid sequence shown in Sequence ID 
No . 4 , 5 or 6 . 

In addition, a fusion polypeptide having G-CSF activity 
and TPO activity, in which at least one amino group of the 
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above-mentioned polypeptide is chemically modified with a 
polyalkylene glycol derivative, is also included in the fusion 
polypeptide of the present invention. 

Examples of the polyaTkylene derivative include a 
^polyethylene glycol derivative , a polypropylene glycol 
derivative, a polyoxyethylene-polyoxypropylene copolymer 
derivative and the like. Polyethylene glycol-succinimidyl 
propionate is preferred. 

The fusion polypeptide chemically modified with a 
<® polyethylene glycol derivative can be prepared in accordance 
with the method described in Japanese Examined Patent 
Publication No. 96558/95. 

The DNA which codes for the " fusion polypeptide 
(hereinafter referred to as "TPO-CSF" ) of the present invention 
(^can be obtained by polymerase chain reaction (PCR) and the like 
based on the known nucleotide sequences of a polypeptide having 
TPO activity and a polypeptide having G-CSF activity. It can 
also be obtained by chemical synthesis. 

Examples of DNA which codes for TPO-CSF include a DNA 
containing a nucleotide sequence that codes for a polypeptide 
having the amino acid sequence shown in Sequence ID No. 1, 2 or 
3 or a polypeptide derived from the amino acid sequence of the 
polypeptide by substitution, deletion or addition of one or 
more amino acids but having the G-CSF activity and TPO 



activity, such as a DNA which contains the nucleotide sequence 



^jjj^ shown in Sequence ID Noj? 4, 5 or 6 
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Other examples are DNA's in which mutation such as 

substitution mutation, deletion mutation, insertion mutation or 

the like is introduced into the above-mentioned DNA within such 

a range that the G-CSF activity and TPO activity are not 

spoiled, which can be obtained, for example, by colony 

hybridization or plaque hybridization using a DNA containing 

£ 

the nucleotide sequence shown in Sequence ID Ncj^. 4, 5 or 6 as 
a probe . 

An example is a DNA which is identified by carrying out 
hybridization of a membrane filter on which colony- or plaque- 
originated DNA is fixed, at 65 °C in the presence of 0.7 to 1.0 
M sodium chloride using a DNA containing the nucleotide 
sequence shown in Sequence ID N<^^4, 5 or 6 as a probe, and 
subsequently washing the resulting filter at 65 °C in 0.1 to 2- 
fold SSC solution (1-fold SSC contains 150 mM sodium chloride 
and 15 mM sodium citrate) . 

The hybridization techniques are described in 
"Molecular Cloning, A laboratory manual M , second edition 
(edited by Sambrook, Fritsch and Maniatis, Cold Spring Harbor 
Laboratory Press, 1989). 

All polypeptides encoded by the DNA defined in the 
foregoing are included in the TPO-CSF . 

Examples of plasmids containing the TPO-CSF-encoding 
DNA include pBS-T153LND28 , pBS-T154ND28 and pBS-T153ND28LNl . 
Escherichia coli TLN-1 as a colon bacillus containing pBS- 
T153LND28 and Escherichia coli TN-1 as a colon bacillus 
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containing pBS-T154ND28 have been deposited on February 16, 
1995 , in National Institute of Bioscience and Human -Technology, 
Agency of Industrial Science and Technology, Higashi 1-1-3, 
Tsukuba-shi, Ibaraki, Japan (tfie postal code: 305), and have 
been assigned the designations as FERM BP-5001 and FERM BP- 
5002, respectively . 

in order to express the thus obtained TPO-CSF-encoding 
gene (hereinafter referred to as "TPO-CSF gene") in a host, a 
DNA fragment containing the TPO-CSF gene is first cleaved into 
a TPO-CSF gene-containing DNA of an appropriate length with 
restriction enzymes or DNA hydrolyzing enzymes and inserted 
into downstream site of a promoter gene on an expression vector 
and then the thus DNA-inserted expression vector is introduced 
into a host suitable for the expression vector. 

As the host, any host capable of expressing the 
intended gene can be used. Examples thereof include microbial 
strains belonging to the genera Escherichia, Serratia, 
Coxrynebacterium r Brevibacterium, Pseudomonas , Bacillus and the 
like, as well as yeast strains, animal cell hosts and the like. 

Useful as the expression vector is a vector which can 
replicate by itself in the above-mentioned host or can be 
inserted into its chromosome and has a promoter at a site where 
transcription of the TPO-CSF gene can be made. 

When a microorganism such as Escherichia coli or the 
like is used as the host, it is desirable that the TPO-CSF 
expression vector can replicate by itself in the microorganism 
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and comprises a promoter, a ribosome binding sequence, the TPO- 
CSF gene and a transcription termination sequence. It may also 
contain a regulatory gene. 

Examples of the expression vector include pBTrp2, 
pBTacl and pBTac2 (all available from Boehringer-Mannheim Co. ) , 
pKYPIO (Japanese Published Unexamined Patent Application No. 
110600/83), pKYP200 [Agric. Biol. Chem. , 48, 669 (1984)], pLSAl 
[Agric. Biol. Chem., 53, 277 (1989)], pGELl [Proc. Natl. Acad. 
5ci., USA, 82, 4306 (1985)], pBluescript (available from 
STRATAGENE Co.), P Trs30 [prepared from Escherichia coli 
JM109/pTrs30 (FERM BP-5407 ) ] , pTrs32 [prepared from Escherichia 
coli JM109/pTrs32 (FERM BP-5408)], pAGE107 [Japanese Published 
Unexamined Patent Application No7~ 22979?91 ; Miyaji et al. , 
Cytotechnology, _3, 133 (1990)], pAS3-3 (Japanese Published 
Unexamined Patent Application No. 227 075/90) and pAMoERC3Sc 
CDM8 [Brian Seed et al. , Nature, 329 , 840 (1987)]. 

As the promoter, any one capable of exerting expression 
in a host such as Escherichia coli or the like can be used. 
Examples thereof include promoters originated from Escherichia 
coli, phages and the like, such as trp promoter ( Ptrp T, lac 
promoter (Plac), P L promoter, P R promoter and the like. Also 
useful are artificially designed and modified promoters such as 
a promoter prepared by connecting two P trp promoters in series 
( Ptrpx 2 ) , tac promoter and the like . 

As the ribosome binding sequence, any sequence capable 
of exerting expression in a host such as Escherichia coli or 
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the like can be used, but it is desirable to use a plasmid in 
which the ribosome binding sequence and the initiation codon 
are arranged with an appropriate distance (for example, 6 to 18 
bases ) . 

Any gene which codes for TPO-CSF can be used as the 
TPO-CSF gene, but it is desirable to use the gene by 
substituting its bases in such a manner that the DNA sequence 
of the gene has codons most suitable for its expression in host 
microorganisms • 

Although the transcription termination sequence is not 
always necessary for the expression of the gene, it is 
desirable to arrange the transcription termination sequence 
preferably just downstream of the "structural gene. 

Examples of the host include Escherichia coll XLl-Blue, 
Escherichia coll XL2-Blue, Escherichia coll DH1, Escherichia 
coll DH5 a, Escherichia coll MC1000, Escherichia coll KY3276, 
Escherichia coll W1485, Escherichia coll JM109, Escherichia 
coll HB101, Escherichia coll No* 49, Escherichia coll W3110, 
Escherichia coll NY49, Bacillus subtills , Bacillus 
amylollquefacience, Brevibacterium Immariophllum ATCC 14068, 
Brevibacterium saccharolytlcum ATCC 14066, Brevibacterium 
flavum ATCC 14067, Brevibacterium lactofermentum ATCC 13869, 
Corynebacterium glutamicum ATCC 13032, Corynebacterium 
acetoacidophilum ATCC 13870, Microbacterium ammoniaphilum ATCC 
15354 and the like. 
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When a yeast strain is used as the host, YEpl3 (ATCC 
37115), YEp24 (ATCC 37051), YCpSO (ATCC 37419) or the like may 
be used as the expression vector. 

Any type of promoter can^be used, provided that it can 
exert expression in yeast strain hosts. Examples thereof 
include promoters of genes of hexose kinase and the like 
glycolytic- pathway enzymes, gal 1 promoter, gal 10 promoter, 
heat shock protein promoter, MF al promoter, CUP 1 promoter and 
the like. 

Examples of the host include Sacchazromyces cerev±siae f 
Schizosaccharomyces pombe, Kluyveromyces lactis, Trichosporon 
pullulans r SchwanniomycBS alluvius and the like. 

When animal cells are usecl^as the host, examples of 
useful expression vectors include pcDNA I /Amp, pcDNA I, pcDM8 
(all available from Funakoshi Co., Ltd.), pcDNA 3 (available 
from Invitrogen Co.), pAGE248, pAGE210 and the like. 

Any promoter capable of exerting expression in the 
animal cell hosts can be used. For example, the promoter of 
human CMV IE (immediate early) gene may be used. Also, the 
enhancer of human CMV IE gene may be used together with the 
promoter , 

Any gene which codes for TPO-CSF can be used as the 
TPO-CSF gene . 

In general, only a portion of TPO-CSF expressed from 
the gene is secreted into the extracellular moiety, so that, in 
order to effect positive extracellular secretion of TPO-CSF 
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from the host, it is desirable to prepare and use a gene having 
a sequence in which a nucleotide sequence coding for a signal 
peptide is added to the gene, in accordance with the method of 
Paulson et al. [C. Paulson et al. , J. Biol. Chem. , 264 , 17619 

(1989) ] and the method of Lowe et al. [John. B. Lowe et al., 
Proc. Natl. Acad. Sci. , USA, 86./ 8227 (1989); John. B. Lowe et 
al., Genes Develop., A, 1288 (1990)]. 

As the host, namalwa cells, HBT5637 (Japanese Published 
Unexamined Patent Application No. 299/88), COS cells, CHO cells 
and the like may be used. 

Introduction of TPO-CSF gene-containing DNA into animal 
cells can be effected by any method, provided that it can 
introduce DNA into animal cells, " For example, an 
electroporation method [Miyaji et al. , Cytotechnology, 3., 133 

(1990) ], a calcium phosphate method (Japanese Published 
Unexamined Patent Application No. 227075/90), a lipofection 
method [Philip L. Feigner et al., Proc. Natl. Acad. Sci., USA, 
84./ 7413 (1987)] and the like may be used. Isolation and 
cultivation of a transformant can be effected in accordance 
with the method described in Japanese Published Unexamined 
Patent Application No. 227075/90 or Japanese Published 
Unexamined Patent Application No. 257891/90. 

TPO-CSF can be produced by cultivating the thus 
obtained transformant in accordance with the usually used 
cultivating method . 
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When a trans formant obtained by using Escherichia coli, 
yeast or the like microorganism as the host is cultivated, the 
medium may be either a natural medium , or a synthetic medium, 
with the proviso that it contains carbon sources, nitrogen 
sources, inorganic salts and the like which can be assimilated 
by the microorganism and cultivating of the transf ormant can be 
made efficiently. 

As the carbon sources, those which can be assimilated 
by respective microorganisms are used, which include 
carbohydrates such as glucose, fructose, sucrose, molasses 
containing them, starch, starch hydrolyzates and the like, 
organic acids such as acetic acid, propionic acid and the like 
and alcohols such as ethanol, propanol arid the like. 

Examples of useful nitrogen sources include ammonia, 
ammonium salts of various inorganic and organic acids , such as 
ammonium chloride, ammonium sulfate, ammonium acetate, ammonium 
phosphate and the like, and other nitrogen-containing 
compounds, as well as peptone, meat extract, yeast extract, 
corn steep liquor, casein hydrolyzate, soybean cake and soybean 
cake hydrolyzate, various fermented microbial cells and digests 
thereof . 

Examples of useful inorganic materials include 
potassium dihydrogenphosphate, dipotassium hydrogenphosphate , 
magnesium phosphate, magnesium sulfate, sodium chloride, 
ferrous sulfate, manganese sulfate, copper sulfate, calcium 
carbonate and the like. 
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Cultivation is carried out under aerobic conditions by 
shaking , submerged-aerial stirring or the like* The 
temperature for the cultivation is preferably 15 to 40°C/ and 
the period for the cultivation* is generally 16 to 96 hours. 
The medium pH is controlled at 3*0 to 9,0 during the 
cultivation. Adjustment of the pH is carried out using an 
inorganic "or organic acid, an alkaline solution, urea, calcium 
carbonate, ammonia and the like. 

As occasion demands, antibiotics such as ampicillin, 
tetracycline and the like may be added to the medium during the 
cultivation . 

When a microorganism transformed with an expression 
vector prepared using an inducible promoter is cultivated, an 
inducer may be added to the medium as occasion demands. For 
example, isopropyl-0-D-thiogalactopyranoside (IPTG) or the like 
may be added to the medium when a microorganism transformed 
with an expression vector prepared using lac promoter is 
cultivated, or indoleacetic acid { IAA) or the like when a 
microorganism transformed with an expression vector prepared 
using trp promoter is cultivated. 

When a transformant obtained using animal cells as the 
host is cultivated, generally used RPMI 1640 medium, MEM medium 
(manufactured by Eagle Co. or GibcoBRL Co.), D-MEM medium 
(manufactured by GibcoBRL Co.) or any one of these media 
further supplemented with fetal bovine serum and the like may 
be used. 
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The cultivation is carried out, for example, in the 
presence of 5% C0 2 . The temperature for the cultivation is 
preferably 35 to 37 °C, and the period for the cultivation is 
generally 3 to 7 days . 

As occasion demands, antibiotics such as kanamycin, 
penicillin and the like may be added to the medium during the 
cultivation. 

Productivity can be increased using a gene 
amplification system in which dihydrof olate reductase gene and 
the like are used, in accordance with the method described in 
Japanese Published Unexamined Patent Application No, 227075/90. 

The TPO-CSF of the present invention obtained in this 
manner can be purified by commonly used protein purification 
techniques . 

For example, when the TPO-CSF is not secreted into 
outside moiety of the host cells, a culture broth of the 
transformant is subjected to centrif ugation to collect cells in 
the culture broth, and the thus collected cells are washed and 
then disrupted using a sonicator, French press, Manton Gaulin 
homogenizer, Dynomil or the like, thereby obtaining a cell-free 
extract. Thereafter, the cell-free extract is subjected to 
centrif ugation, and the TPO-CSF is purified from the resulting 
supernatant fluid making use of various techniques including 
salting out with ammonium sulfate or the like salt, anion 
exchange chromatography on diethylaminoethyl ( DEAE ) -Sepharose 
or the like, hydrophobic chromatography on Butyl sepharose, 
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Phenylsepharose or the like, molecular sieve-aided gel 
filtration and various types of electrophoresis such as 
isoelectric focusing and the like. 

When the TPO-CSF is secreted, purified TPO-CSF can be 
obtained from a culture filtrate of the transformant in the 
same manner as the case of the above-mentioned treatment of 
cell-free ■ extract supernatant * 

When produced in Escherichia coll cells, it can be 
purified efficiently by the combination of the above-mentioned 
method with the method described in Japanese Published 
It Unexamined Patent Application No* 267292/88. 

Also, it is possible to produce the TPO-CSF of the 
Y l " present invention in the form of its lusion protein with 

another protein and to purify the product by affinity 

H chromatography using a substance having affinity for the fused 

i.. .j 

O protein. For example, it is possible to produce the TPO-CSF of 

O the present invention as its fusion protein with protein A and 

purify it by an immunoglobulin G-aided affinity chromatography, 
in accordance with the method of Lowe et ai. [John. B. Lowe et 
al., Proc. Natl. Acad. Scl. , USA, 8£, 8227 (1989 ); John. B. 
Lowe et ai., Genes Develop., 4., 1288 (1990)]. 

In addition, it can also be purified by affinity 
chromatography using antibodies specific for a polypeptide 
which has G-CSF activity, such as antibodies specific for G- 
CSF. 
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The TPO-CSF of the present invention can be used as it 
is or as pharmaceutical compositions in various dosage forms . 

The pharmaceutical compositions of the present 
invention are produced by mixing" an effective amount of TPO-CSF 
as the active ingredient uniformly with pharmacologically 
acceptable carriers . 

Preferably, these pharmaceutical compositions may be 
prepared in the form of unit dose packages suitable for 
injection. 

Injections for use in injection administration can be 
prepared by using a carrier such as distilled water, a salt 
solution of sodium chloride or of a mixture of sodium chloride 
with other inorganic salts", a sugar solution of mannitol, 
lactose, dextran, glucose or the like, an amino acid solution 
of glycine, arginine or the like, an organic acid solution, an 
organic base solution or a mixture solution comprising a salt 
solution and a sugar solution. In that case, the composition 
can be made into solutions, suspensions or dispersions in the 
usual way using auxiliaries which include an osmotic pressure 
adjusting agent, a plant oil such as sesame oil or soybean oil 
and a surface active agent such as lecithin or a nonionic 
surface active agent. These solutions can be made into solid 
preparations by powder making, freeze drying and the like 
means, which are dissolved again prior to their use. 

The above-mentioned pharmaceutical compositions which 
contain the TPO-CSF of the present invention as the active 
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ingredient are useful for the treatment anemia or patients who 
become anemic as a result of treatment of diseases • 

BRIEF EXPLANATION OF THE DRAWINGS 

Fig. 1 is an illustration showing construction of a 
plasmid containing DNA which codes for TPO-ND28 (1)- 

Fig, 2 is an illustration showing construction of a 
plasmid containing DNA which codes for TPO-ND28 (2). 

Fig. 3 is an illustration showing construction of a 
plasmid containing DNA which codes for TPO-ND28 (3). 

BEST MODE OF CARRYING OUT THE INVENTION 

Exampl e 1 Preparation of DNA which codes' for TPO-CSF 

A DNA which codes for TPO-CSF was prepared in the 
following manner, using a DNA which codes for a polypeptide 
ND28 in which the 1st position amino acid residue of the amino 
acid sequence of human G-CSF was substituted by alanine (Ala), 
and the 3rd position amino acid by threonine (Thr), the 4th 
position amino acid by tyrosine (Tyr), the 5th position amino 
acid by arginine (Arg) and the 17th position amino acid by 
serine (Ser) (Japanese Published Unexamined Patent Application 
No. 267292/88) as a DNA which codes for a polypeptide having G- 
CSF activity, and a DNA that codes for a polypeptide having the 
amino acid sequence of Table 3 (de Sauvage et al . , Nature, 369 , 
533 (1994); hereinafter referred to as "TPO") as a DNA which 
codes for a polypeptide having TPO activity. The fusion 
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polypeptide of TPO and ND28 is abbreviated as TPO-ND28 
hereinafter. 

1 . Preparation of TPO gene 

A TPO-encoding gene (hereinafter referred to as "TPO 
gene") for use in the preparation of TPO-ND28 was obtained by 
PCR in the following manner on the basis of the nucleotide 
sequence "reported by de Sauvage et al. [Nature, 369 , 533 
(1994)] . 

A DNA shown in Sequence ID No. 7 containing 5 ' end 
nucleotide sequence of the TPO gene (hereinafter referred to as 
"primer 1") and a DNA shown in Sequence ID No. 8 containing 3' 
end nucleotide sequence of the TPO gene (hereinafter referred 
to as "primer 2 H ) were synthesized using 380A DNA synthesizer 
of Applied Biosystems, Inc. In order to facilitate the 
cloning f a restriction enzyme recognition sequence was added to 
the terminus of each primer. 

Amplification and cloning of the TPO gene translation 
region sequence were carried out by reverse transcription PCR 
using the primers 1 and 2, human liver poly A + mRNA 
(manufactured by Clontech Co., product No. CL 6510-1) mRNA and 
Superscript Preamplif ication System for First Strand cDNA 
Synthesis Kit (manufactured by GibcoBRL Co,), 

A 0.013 ml portion of aqueous solution containing 1,000 
ng of human liver poly A* mRNA and 500 ng of oligo(dt) 12-18 
(included in the kit) was treated at 7 0°C for 10 minutes and 
then allowed to stand in ice for 1 minute. 
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The resulting solution was mixed with 0.002 ml of ten 
times -concentrated synthesis buffer, 0.001 ml of 10 mM dNTP 
mix, 0,002 ml of 0.1 M DTT and 0.001 ml of Superscript II RT 
(200 kU/ml) (all included in the kit), and the mixture was 
allowed to stand at room temperature for 10 minutes and then 
incubated at 42°C for 50 minutes. After completion of the 
incubation, the mixture was heated at 90 °C for 5 minutes to 
terminate the reverse transcription reaction. 

The reaction solution was mixed with 0.001 ml of E . 
coli RNase H (2,000 U/ml; included in the kit) and incubated at 
37 °C for 20 minutes. 

A 0.1 ml portion of a reaction solution containing 

0. 005 ml of the above reaction solution, "400 nM of the primer 

1, 400 nM of the primer 2, 20 mM of Tris-HCl (pH 8.2), 10 mM of 
potassium chloride, 0.01 mg/ml of bovine serum albumin 
(hereinafter referred to as "BSA" ) , 2 mM of magnesium chloride, 
6 mM of ammonium sulfate, 0.1% Triton X-100, 10% dimethyl 
sulfoxide (hereinafter referred to as "DMSO"), 0.05 mM of 
deoxyadenosine triphosphate (hereinafter referred to as 
" dATP " ) , 0.05 mM of deoxycytidine triphosphate (hereinafter 
referred to as "dCTP"), 0.05 mM of deoxyguanosine triphosphate 
(hereinafter referred to as "dGTP " ) and 0.05 mM of 
deoxythymidine triphosphate (hereinafter referred to as " dTTP " ) 
was mixed with 2.5 units of Pfu polymerase (manufactured by 
Stratagene Co.) to carry out PCR using PERKIN ELMER CETUS DNA 
Thermal Cycler (manufactured by Takara Shuzo Co., Ltd.) by 35 
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time repetition of a three step incubation at 94 °C for 45 
seconds, at 50°C for 1 minute and at 72 °C for 2 minutes. 

The resulting reaction solution was subjected to 
phenol /chloroform extraction and ethanol precipitation, and the 
thus obtained precipitate was dissolved in 0.015 ml of TE 
buffer [10 mM Tris-HCl (pH 8.0) and 1 mM 
ethylenediaminetetraacetic acid (hereinafter referred to as 
"EDTA" ) ] . 

The thus prepared solution was mixed with restriction 
enzymes Hindi! I and Kpn l to cleave the DNA amplified by PCR. 

The resulting solution was subjected to an agarose gel 
electrophoresis, and a Hind lll- Kpn l treated DNA of about LI kb 
was isolated from the agarose gel"~ 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.), the thus isolated DNA (50 ng) was ligated 
with a Hind lll- Kpn l cleaved 2.9 kb fragment (30 ng) of a 
plasmid vector pBlueScript II SK(-) having a multicloning site 
(manufactured by Stratagene Co.) (volume of the reaction 
solution: 0.018 ml). 

Using this reaction solution, an Escherichia coli 
strain DHSot (Library Efficiency DH5a Competent Cell, 
manufactured by GibcoBRL Co.) was transformed in the usual way, 
and the resulting trans formant was spread on LB agar medium 
containing 50 ng/ml of ampicillin and cultivated overnight at 
37°C. 
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Plasmids were isolated from several transf ormant 
strains grown on the medium in accordance with a known method 
[Birnboim et al. , Nucleic Acids Res., 2/ 1513 (1979)]. 

Nucleotide sequence of T:he insertion fragment in each 
plasmid was determined using Taq DyeDeoxy Terminator Cycle 
Sequencing Kit (manufactured by Applied Biosys terns Japan Inc., 
product No. 401113) and ABI373A DNA Sequencer (manufactured by 
Applied Biosystems Japan Inc.). In determining the nucleotide 
sequence, six DNA's having the nucleotide sequences of Sequence 
ID Nos . 9 to 13 or 14 and two primers having the nucleotide 
sequence shown in Sequence ID No. 15 or 16 containing a 
nucleotide sequence in the vector were synthesized based on the 
nucleotide sequence of TPO "gene [de Sauvage et al., Nature, 
369 , 533 (1994)] and used as primers for the nucleotide 
sequence determination. 

Determination of nucleotide sequence was carried out in 
accordance with the instructions attached to the kit and 
apparatus . 

Of the above-mentioned plasmids, a plasmid pBS-TP0332 
which coincided with the reported nucleotide sequence of the 
insertion fragment of TPO gene was used in the subsequent 
procedures . 

2. Construction and expression of DNA which codes for TPO-ND28 
Using the TPO-encoding DNA obtained in Example 1-1 and 
the ND2 8 -encoding DNA obtained by the method described in 
Japanese Published Unexamined Patent Application No. 267292/88, 
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a fusion polypeptide of TPO and ND28 (TPO on the N-terminal 
side and ND28 on the C-terminal side), TPO-ND-28, was prepared 
in the following manner . 

1 ) Construction of DNA ( Sequence" ID No. 5) which codes for TPO- 
ND28 (1) F Sequence ID No. 2; a type constructed through a 
linker (Gly Gly Gly Ser Glv Glv Glv Ser Glv Glv Glv Ser Arq; 
sequence ID No. 17 n 

Though the mature type TPO comprises 332 amino acids, 
it is reported that its shortened protein consisting of its N- 
terminal side 153 amino acids can show the same activity of the 
complete length TPO [de Sauvage et al., Nature, 369 , 533 
(1994) ], so that a DNA which codes for TP0-ND28 (1) in which 
the 153 amino acids from the "N-terminal of" TPO, used as its N- 
terminal side, was fused with the complete length ND28 (174 
amino acids) as the C-terminal side through a linker (Gly Gly 
Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg) was prepared in 
the following manner (cf. Fig. 1). 

(i) Preparation of DNA which codes for the TPO moiety of TPO- 
ND28 (1) 

In order to prepare a DNA which codes for the TPO 
moiety of TPO-ND28 (1) by means of PCR, a DNA primer having a 
nucleotide sequence (Sequence ID No. 18) which corresponds to 
the linker was synthesized as the 3' end primer (hereinafter 
referred to as "primer 3"). 

Using the thus synthesized primer 3 and the primer 1 
and pBS-TP03 32, PCR was carried out in the following manner. 
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A 0.1 ml portion of a reaction solution containing 10 
ng of pBS-TP0332, 400 nM of the primer 3, 400 nM of the primer 
l f 20 mM of Tris-HCl (pH 8.2), 10 mM of potassium chloride, 
0.01 mg/ml of BSA, 2 mM of magnesium chloride, 6 mM of ammonium 
sulfate, 0.1% Triton X-100, 10% DMSO, 0.05 mM of dATP, 0.05 mM 
of dCTP, 0.05 mM of dGTP and 0.05 mM of dTTP was mixed with 2.5 
units of Pfu polymerase to carry out PCR using PERK IN ELMER 
CETUS DNA Thermal Cycler (manufactured by Takara Shuzo Co., 
Ltd.) by 18 time repetition of a three step incubation at 94 °C 
for 45 seconds, at 50 °C for 1 minute and at 72°C for 1 minute. 

The resulting reaction solution was subjected to 
phenol/chloroform extraction and ethanol precipitation, and the 
thus obtained precipitate was dissolved" in 0.015 ml of TE 
buffer . 

The thus prepared solution was mixed with restriction 
enzymes Hind i I I and Xba l to cleave the DNA amplified by PCR. 

The resulting solution was subjected to an agarose gel 
electrophoresis, and a Hindlll-Xbal treated DNA fragment of 
about 0.6 kb was isolated from the agarose gel. 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.), the thus isolated DNA fragment (100 ng) was 
ligated with a Hindlll-Xbal cleaved 2.9 kb fragment (50 ng) of 
pBlueScript II SK(-) (volume of the reaction solution: 0.018 
ml) . 

Using this reaction solution, the Escherichia coll 
strain DH5ct was transformed in the usual way, and the resulting 
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trans formant was spread on LB agar medium containing 50 ng/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several trans formant 
strains grown on the medium in accordance with a known method. 

Nucleotide sequence of the insertion fragment in each 
plasmid was determined using Taq DyeDeoxy Terminator Cycle 
Sequencing Kit and ABI373A DNA Sequencer (manufactured by 
Applied Biosystems Japan Inc.). In determining the nucleotide 
sequence, primers having the nucleotide sequences of Sequence 
ID Nos. 9 to 12, 15 and 16 were used as primers for the 
nucleotide sequence determination. 

Determination of nucleotide sequence was carried out in 
accordance with the instructions" attached to the kit and 
apparatus . 

Of the above-mentioned plasmids, plasmid pBS-T153LND 
which coincided with the reported nucleotide sequence of the 
insertion fragment of TPO gene was used in the subsequent 
procedures . 

fii) Preparation of DNA which codes for the ND28 moiety of TPO- 
ND28 (1) 

In order to prepare a DNA which codes for the ND28 
moiety of TPO-ND28 (1) by means of PCR, a primer having a 
nucleotide sequence (Sequence ID No. 19) which corresponds to 
the linker and the amino acid sequence of ND28 was synthesized 
as the 5' end primer (hereinafter referred to as "primer 4"), 
and a primer having a nucleotide sequence (Sequence ID No. 20) 
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which corresponds to the C-tenninal side amino acid sequence of 
ND28 was synthesized as the 3 ' end primer (hereinafter referred 
to as "primer 5"). 

Using the thus synthesized primers and plasmid pCfBD28 
(Japanese Published Unexamined Patent Application No. 
267292/88), PCR was carried out in the following manner. 

A* 0.1 ml portion of a reaction solution containing 10 
ng of pCfBD28 / 400 nM of the primer 4, 400 nM of the primer 5, 
20 mM of Tris-HCl (pH 8.2), 10 mM of potassium chloride, 0.01 
mg/ml of BSA, 2 mM of magnesium chloride, 6 mM of ammonium 
sulfate, 0.1% Triton X-100, 10% DMSO, 0.05 mM of dATP , 0.05 mM 
of dCTP, 0.05 mM of dGTP and 0.05 mM of dTTP was mixed with 2.5 
units of Pf u polymerase to "carry out PCR using PERKIN ELMER 
CETUS DNA Thermal Cycler (manufactured by Takara Shuzo Co., 
Ltd.) by 18 time repetition of a three step incubation at 94 °C 
for 45 seconds, at 50°C for 1 minute and at 72°C for 1 minute. 

The resulting reaction solution was subjected to 
phenol/chloroform extraction and ethanol precipitation, and the 
thus obtained precipitate was dissolved in 0.015 ml of TE 
buffer. 

The thus prepared solution was mixed with restriction 
enzymes SacII and Xbal to cleave the DNA amplified by PCR. 

The resulting solution was subjected to an agarose gel 
electrophoresis, and a Sacil-Xbal cleaved DNA fragment of about 
0.5 kb was isolated from the agarose gel. 
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Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.)/ the thus isolated DNA fragment (100 ng) was 
ligated with a SacII -Xba l cleaved 2.9 kb fragment (50 ng) of 
pBlueScript II SK(-) (volume of the reaction solution: 0.018 
ml) . 

Using this reaction solution, the Escherichia coll 
strain DH5a was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 ug/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Nucleotide sequence of the insertion fragment in each 
plasmid was determined using Taq~DyeDedxy Terminator Cycle 
Sequencing Kit and ABI373A DNA Sequencer. in determining the 
nucleotide sequence, two DNA' s having the nucleotide sequence 
of Sequence ID No. 21 or 22 containing a nucleotide sequence of 
the ND28-encoding DNA and two DNA' s having the nucleotide 
sequence of Sequence ID No. 15 or 16 containing a sequence 
present in the vector were used as primers for the nucleotide 
sequence determination. 

Determination of nucleotide sequence was carried out in 
accordance with the instructions attached to the kit and 
apparatus . 

Of the above-mentioned plasmids, plasmid pBS-LND28 in 
which the nucleotide sequence of the insertion fragment 
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coincided with the nucleotide sequences of the ND28 gene and 

primers was used in the subsequent procedures « 

(iii) Preparation of DNA which codes for TPO-ND28 fl) 

The DNA's respectively ~which code for the TPO moiety 
and ND28 moiety prepared in Example 1-2-1 )-(i) and (ii) were 
fused in the following manner, 

A 2,000 ng portion of pBS-T153LND was cleaved with 
restriction enzymes Sac II and Xbal and subjected to an agarose 
gel electrophoresis to isolate a DNA fragment of about 3.5 kb. 

Also, a 500 ng portion of pBS-LND28 was cleaved with 
restriction enzymes Sac II and Xbal and subjected to an agarose 
gel electrophoresis to isolate a DNA fragment of about 0.5 kb. 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.), the DNA fragment of about 3.5 kb (100 ng) was 
ligated with the DNA fragment of about 0.5 kb (100 ng) (volume 
of the reaction solution: 0.018 ml). 

Using this reaction solution, the Escherichia coll 
strain DH5a was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 jig/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Structures of these plasmids were examined using 
restriction enzymes Sac II and Xbal, and plasmid pBS-T153LND28 
having a structure in which both of the DNA fragments are 
ligated with each other was used in the subsequent procedures . 
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2) Construction of DNA (Sequence ID No. i) which codes for TPO- 
ND28 (2) T Sequence ID No, 1: a type constructed without a 
linker! 

A DNA which codes for TPO-ND28 (2) in which the 154 
amino acids of TPO from its N-terminal were fused with the N- 
terminal of D28 (174 amino acids) was prepared in the following 
manner (cf. Fig. 2). 

fi) Preparation of DNA which codes for the TPO moiety of TPO- 
ND28 (2) 

In order to prepare a DNA which codes for the TPO 
moiety of TPO-ND28 (2) by means of PCR, a primer having a 
nucleotide sequence shown in Sequence ID No. 23 which has a 
nucleotide sequence that "corresponds "to the amino acid 
sequences of TPO and ND28 was synthesized as the 3' side primer 
(hereinafter referred to as "primer 6"). 

Using the thus synthesized primer 6 and the primer 1 
and pBS-TP0332, PGR was carried out in the following manner. 

A 0.1 ml portion of a reaction solution containing 10 
ng of pBS-TP0332, 400 nM of the primer 1, 400 nM of the primer 
6, 20 mM of Tris-HCl (pH 8.2), 10 mM of potassium chloride, 
0.01 mg/ml of BSA, 2 mM of magnesium chloride, 6 mM of ammonium 
sulfate, 0.1% Triton X-100, 10% DMSO, 0.05 mM of dATP, 0.05 mM 
of dCTP, 0.05 mM of dGTP and 0.05 mM of dTTP was mixed with 2.5 
units of Pf u polymerase to carry out PCR using PERKIN ELMER 
CETUS DNA Thermal Cycler by 18 time repetition of a three step 
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incubatiion at 94°C for 45 seconds, at 50°C for 1 minute and at 
72 °C for 1 minute. 

The resulting reaction solution was subjected to 
phenol /chloroform extraction and" ethanol precipitation, and the 
thus obtained precipitate was dissolved in 0*015 ml of TE 
buffer. 

The thus prepared solution was mixed with restriction 
enzymes Hind i I I and Xhol to cleave the DNA amplified by PCR. 

The resulting solution was subjected to an agarose gel 
electrophoresis, and a Hind i I I - Xho l cleaved DNA fragment of 
about 0.5 kb was isolated from the agarose gel. 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.), the thus isolated" DNA fragment (100 ng) was 
ligated with a Hind i I I -xhol cleaved 2,9 kb fragment (50 ng) of 
pBlueScript II SK(-) (volume of the reaction solution: 0.018 
ml) . 

Using this reaction solution, the Escherichia coll 
strain DH5a was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 ng/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Nucleotide sequence of the insertion fragment in each 
plasmid was determined using Taq DyeDeoxy Terminator Cycle 
Sequencing Kit and ABI373A DNA Sequencer (manufactured by 
Applied Biosystems Japan Inc.). In determining the nucleotide 
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sequence, primers having the nucleotide sequences of Sequence 
ID Nos. 9 to 12, 15 and 16 were used as primers for the 
nucleotide sequence determination. 

Determination of nucleotide sequence was carried out in 
accordance with the instructions attached to the kit and 
apparatus . 

Of the above-mentioned plasmids, plasmid pBS-T154ND in 
which the nucleotide sequence of the insertion fragment 
coincided with the nucleotide sequences of the TPO gene and 
primers was used in the subsequent procedures . 
(Li) Preparation of DNA which codes for TPQ-ND28 (2) 

The DNA which codes for the TPO moiety prepared in 
Example 1-2-2 )-(i) and the DNA which codes for the ND28 moiety- 
prepared in Example 1-2-1 )-(ii) were fused in the following 
manner . 

A 200 ng portion of pBS-T154ND was cleaved with 
restriction enzymes Kpn l and Xhol and subjected to agarose gel 
electrophoresis to isolate a DNA fragment of about 3.5 kb. 

Also, a 500 ng portion of pBS-LND28 was cleaved with 
restriction enzymes Kpn l and Xho l and subjected to agarose gel 
electrophoresis to isolate a DNA fragment of about 0*5 kb. 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co,, Ltd.), the DNA fragment of about 3.5 kb (100 ng) was 
ligated with the DNA fragment of about 0*5 kb (100 ng) (volume 
of the reaction solution: 0.018 ml). 
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Using this reaction solution, the Escherichia coll 
strain DH5a was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 ng/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Structures of these plasmids were examined using 
restriction enzymes Kpn II and Xho l , and plasmid pBS-T154ND28 
having a structure in which both of the DNA fragments are 
ligated with each other was used in the subsequent procedures, 
3) Construction of DNA (Sequence ID No. 6) which codes for TPO- 
ND28 (3] f Sequence ID No . 3; a type constructed through a 
linker (Ser Gly Glv Glv Ser Glv Glv~Gly Ser Gly Gly Glv Ser Glv 
Gly Glv Arq; sequence ID No. 24)1 

A DNA which codes for TPO-ND28 (3) in which the 153 
amino acids from the N- terminal of TPO, used as its N-terminal 
side, was fused with the complete length ND28 (174 amino acids) 
as the C-terminal side through a linker (Ser Gly Gly Gly Ser 
Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg) was prepared 
in the following manner (cf. Fig. 3). 

In order to ligate the DNA which codes for the TPO 
moiety prepared in Example 1-2-2 )-(i) with the DNA which codes 
for the ND28 moiety prepared in Example 1-2-1 )-(ii) through a 
linker (Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly 
Gly Gly Arg), two DNA' s shown in Sequence ID Nos . 25 and 26 
having nucleotide sequences which form Spl l- Bbe l complementary 



- 38 - 



termini on both sides corresponding to the amino acid sequences 
of linkers were synthesized. 

A 0.02 ml portion of a solution containing 0.01 mM of 
the DNA shown in Sequence ID tfb. 25, 5 mM of ATP, 50 mM of 
Tris-HCl (pH 8.0), 10 mM of magnesium chloride and 5 mM of 
dithiothreitol was mixed with 10 units of T4 Polynucleotide 
Kinase (manufactured by Takara Shuzo Co., Ltd.), and the 
mixture was allowed to stand at 37 °C for 30 minutes and then 
heated at 7 0°C for 3 minutes to obtain treating solution (1). 

The DNA shown in Sequence ID No. 26 was also treated in 
the same manner to obtain treating solution (2). 

Treating solution (1) was mixed with treating solution 
(2), and the mixture was incubated at 90 5 C for 5 minutes and 
then gradually cooled to 22°C spending 3 hours to prepare 
double-stranded DNA. 

The thus prepared double-stranded DNA was inserted into 
the connecting site of the TPO-coding gene and ND28-coding gene 
of pBS-T154ND28 obtained in Example 1-2-2 )-(ii) in the 
following manner. 

A 2,000 ng portion of pBS-T154ND28 was cleaved with 
restriction enzymes Bbe l and Soi l and subjected to an agarose 
gel electrophoresis to isolate a DNA fragment of about 4.0 kb. 

Using DNA Ligation Kit Ver. 1 (manufactured by Takara 
Shuzo Co., Ltd.), the DNA fragment of about 4.0 kb (100 ng) was 
ligated with the above-mentioned double-stranded DNA (12.5 
pmole) (volume of the reaction solution: 0.018 ml). 
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Using this reaction solution, the Escherichia coll 
strain DH5a was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 jig/ml 
of ampicillin and cultivated overnight at 37°C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Nucleotide sequence of the insertion fragment in each 
plasmid was determined using Taq DyeDeoxy Terminator Cycle 
Sequencing Kit and ABI373A DNA Sequencer. In determining the 
nucleotide sequence, two DNA' s shown in Sequence ID Nos . 12 and 
22 were used as primers. Determination of nucleotide sequence 
was carried out in accordance with the instructions attached to 
the kit and apparatus . 

Of these plasmids, plasmid named pBS-T153ND28LNl in 
which the nucleotide sequence of the insertion fragment 
coincided with the nucleotide sequence of the linker DNA was 
used in the subsequent procedures . 

Example 2 Production of TPO-CSF 

The TPO-CSF was produced by effecting expression of the 
DNA which codes for the TPO-CSF in animal cells in the 
following manner. 

1) Production of TPO-ND28 (1) and TPO-ND28 (21 

Plasmid pcDNA3 (manufactured by Invitrogen Co.) was 
cleaved with EcoR I and Not I and subjected to an agarose gel 
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electrophoresis to isolate a DNA fragment (vector side) of 
about 5,4 kb . 

Also, pBS-T153LND28 and pBS-T154ND28 obtained in 
Example 1-2-1 )-(iii) and Example 1-2-2 )-(ii) were separately 
cleaved with EcoRI and Not I and subjected to agarose gel 
electrophoresis to isolate a DNA fragment (insert side) of 
about 1.1" kb from each plasmid. 

Using DNA Ligation Kit Ver. 1, the vector side DNA 
fragment of about 5,4 kb (100 ng) was ligated with each of the 
insert side DNA fragments (100 ng) (volume of the reaction 
solution: 0.018 ml). 

Using this reaction solution, the Escherichia coli 
strain DH5a was transformed in the usual way, and the resulting 
transformant was spread on LB agar medium containing 50 ng/ml 
of ampicillin and cultivated overnight at 37 °C. 

Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method . 

Structure of each plasmid was examined using 
restriction enzymes EcoR I and Not I to select plasmids 
containing respective inserts having a structure in which the 
vector side and insert side DNA fragments are ligated with each 
other, and plasmid pCD-153LND28 containing a TPO-ND28 (1) 
encoding gene and plasmid pCD-154ND28 containing a TPO-ND28 (2) 
encoding gene were used in the subsequent procedure . 

Plasmid pCD-153LND28 or pCD-154ND28 was introduced into 
animal cells by electroporation [Potter et al., Proc. Natl. 
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Acad. Sci., USA, £1., 7161 (1984)] and its expression was 
effected in the following manner. 

COS 7 cells were cultivated in D-MEM medium 
(manufactured by GibcoBRL Co., product No. 11885-50) which was 
further supplemented with 10% fetal bovine serum. 

The COS 7 cells obtained by cultivation were suspended 
in K-PBS buffer (137 mM potassium chloride, 2.7 mM sodium 
chloride, 8.1 mM disodium hydrogenphosphate , 1.5 mM sodium 
dihydrogenphosphate, 4 mM magnesium chloride) to prepare a cell 
suspension of 8x10 s cells/ml. 

A 0.2 ml portion of the cell suspension was injected 
into a Pulser Cuvette (manufactured by BIO RAD LABORATORIES) 
having a slit width of 0.2 cm. 

A 4 ug portion of pCD-153LND28 or pCD-154ND28 was added 
to the cuvette, thoroughly mixed with the suspension and then 
subjected to pulse application using an electroporation 
apparatus (Gene Pulser, manufactured by BIO RAD LABORATORIES) 
under conditions of 200 Q, 0.3 kv/cm and 0.125 mF. 

The pulse-treated solution was allowed to stand in ice 
for 5 minutes, suspended in 10 ml of D-MEM medium supplemented 
with 10% fetal bovine serum and then cultivated at 37 °C for 72 
hours in a C0 2 incubator. 

The culture broth was subjected to centrif ugation, and 
the resulting culture supernatant was filtered through a filter 
of 220 nm pore size to obtain a solution of TP0-ND28 (1) or 
TPO-ND28 (2). 
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2) Production of TPO-ND28 (3^ 

A plasmid PAGE210 was used as the vector for use in the 
expression of TPO-ND28 (3)* The vector pAGE210 is a derivative 
of pAGE248 [Sasaki et al., J\ Biol. Chem. , 269 , 14730, (1994)], 
in which the Moloney murine leukemia virus promoter (Xhol- 
Hindlll fragment) has been replaced by SV40 early promoter 
( Xho l- Hind lll fragment) of pAGE103 [Mizukami et al. r J. 
Blochem., 101 , 1307 (1987)]. 

Plasmid pAGE210 was cleaved with Kpn l and Hind i I I and 
subjected to an agarose gel electrophoresis to isolate a DNA 
fragment (vector side) of about 9,0 kb. 

Separately from this, pBS-TP0322 obtained in Example 1- 
1 was cleaved with Kpn l 'and Hind i I I, ~ and pBS-153ND28LNl 
obtained in Example 1-2-3) was cleaved with Kpn l and then 
partially with Hind i I I , and each of the resulting cleaved 
fragments was subjected to an agarose gel electrophoresis to 
isolate a DNA fragment (insert side) of about LI kb from each 
plasmid . 

Using DNA Ligation Kit Ver. 1, the vector side DNA 
fragment of about 9.0 kb (100 ng) was ligated with each of the 
insert side DNA fragments of about 1 . 1 kb (100 ng) (volume of 
the reaction solution: 0.012 ml). 

Using this reaction solution, the Escherichia coll 
strain DHSct was transformed in the usual way, and the resulting 
trans formant was spread on LB agar medium containing 50 jig/ml 
of ampicillin and cultivated overnight at 37 °C. 
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Plasmids were isolated from several transformant 
strains grown on the medium in accordance with a known method. 

Structure of each plasmid was examined using a 
restriction enzyme Kpn l to ~ select plasmids containing 
respective inserts having a structure in which the vector side 
and insert side DNA fragments are ligated with each other, and 
plasmid pAGE210-T332 containing TPO encoding gene and plasmid 
pAGE21Q-LNl containing TPO-ND28 (3) encoding gene were used in 
the subsequent procedure* 

Plasmid pAGE21Q-T332 or pAGE21Q-LNl was introduced into 
animal cells by electroporation * 

CHO cells were cultivated in MEM medium (1) 
(manufactured by GibcoBRL Co~ , product No ."19000-024 ) which was 
further supplemented with 10% fetal bovine serum. 

The CHO cells obtained by cultivation were suspended in 
K-PBS buffer to prepare a cell suspension of 8xl0 6 cells/ml. 

A 0.2 ml portion of the cell suspension was injected 
into Pulser Cuvette having a slit width of 0.2 cm. 

A 4 ng portion of pAGE210-T332 or pAGE210-LNl was added 
to the cuvette, thoroughly mixed with the suspension and then 
subjected to pulse application using an electroporation 
apparatus, Gene Pulser, under conditions of 0.35 kv/cm and 0.25 
mF . 

The pulse-treated solution was allowed to stand in ice 
for 5 minutes, suspended in 10 ml of MEM medium supplemented 
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with 10% fetal bovine serum and then cultivated at 37 °C for 24 
hours in a C0 2 incubator* 

The thus cultivated cells were again cultivated for 2 
weeks in MEM medium (1) supplemented with 10% fetal bovine 
serum and 0.3 mg/ml of hygromycin. 

The resulting cells were further cultivated for 2 weeks 
in MEM medium (2) (manufactured by GibcoBRL Co., code No. 
12000-022) supplemented with 10% fetal bovine serum and 50 nM 
methotrexate (hereinafter referred to as MTX). 

The cultivation was repeated in the same manner by 
successively increasing the MTX concentration to 100 nM, 500 nM 
and 1,000 nM in that order, thereby obtaining strains resistant 
to 1,000 nM TMX. " ~ 

Each of the 1,000 nM MTX resistant strains was grown in 
MEM medium (2) supplemented with 10% fetal bovine serum, the 
medium was exchanged with a serum-free medium for CHO cell use, 
CHO-S-SFMII (manufactured by GibcoBRL Co. , code No. 12052-015), 
and then the strain was cultivated again for 96 to 144 hours. 

By subjecting the culture broth to centrif ugation, a 
culture supernatant containing TPO or TPO-ND28 (3) was 
obtained . 

Example 3 Purification of TFO-ND28 (31 and TPO 

A 1,000 ml portion of TPO-ND28 (3) or TPO obtained in 
Example 2-2) was concentrated to 50 ml using Centriprep 
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(manufactured by Amicon Co.) to prepare a concentrated 
solution . 

A 50 ml portion of each of the concentrated solutions 
was applied to XK50 column (manufactured by Pharmacia K.K. ) 
which has been packed with 1,000 ml of Sephacryl S-200 resin 
(manufactured by Pharmacia K.K.) and filled with a phosphate 
buffer (9.4 mM sodium phosphate (pH 7.2), 137 mM NaCl, 2.7 mM 
KC1) . 

Elution of TPO-ND28 (3) or TPO was effected by passing 
the phosphate buffer through the column at a flow rate of 3 
ml /minute . 

The eluates were pooled for every 12.5 minutes, and the 
resulting fractions were checked "for their TPO and G-CSF 
activities by an MTT assay method which will be described 
later, thereby obtaining purified TPO-ND28 (3) or TPO. 

Example 4 Modification of T PO-ND28 (3) with polyethylene 

glycol 

To ice-cooled water was added 20 kd PEG-succinimidyl 
propionate (manufactured by Shearwater Polymers Co. ) to a final 
concentration of 400 mg/ml. 

A 50 ul portion of the thus prepared aqueous solution 
was mixed with 200 al of the TPO-ND28 (3) solution obtained in 
Example 3 and 150 ul of distilled water. The mixture was 
allowed to stand for 12 hours at 4°C, thereby effecting 
modification of TPO-ND28 (3) by polyethylene glycol. 
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t m 

The TPO-ND28 (3) thus modified with polyethylene glycol 
(hereinafter referred to as PEG-TPO-ND28 (3)) was applied to a 
column of Super Rose 610/30 (manufactured by Pharmacia LK.) 
which has been filled in advance with a phosphate buffer (9.4 
M sodium phosphate (pH 7,2), 137 mM HaCl, 2.7 mM Kl). 

Elution was effected by passing the phosphate buffer 
through the column at a flow rate of 0.5 ml/minute. 

The eluates were pooled for every 1 minute, and the 
resulting fractions were checked for their G-CSF and TPO 
activities by MTT assay method which will be described later. 

The results are shown in Table 5. 

The G-CSF and TPO activities originated from un- 
modified TPG-ND28 (3) were 'detected 34 to 40 minutes after 



TABLE 5 

Elution time 

( minutes) 0 10 14 16 18 20 22 24 26 28 30 32 34 36 38 40 

G-CSF activity - + + + + + + + + + + 

TPO activity + + + + + + + + + + + 

-: no activity; +: activity 

Test Example 1 Measurement of TPO-ND28 molecular weight 

Using the TPO-ND28 (1) solution obtained in Example 2- 
1), its molecular weight was measured by a gel filtration 
chromatography in the following manner. 

A 0.2 ml portion of the TPO-ND28 (1) solution was 
applied to a column of Super Rose 610/30 (manufactured by 
Pharmacia K.K.) which has been equilibrated in advance with a 
phosphate buffer (9.4 mM sodium phosphate (pH 7.2), 137 mM 
NaCl, 2.7 mM KC1), and elution of TPO-ND28 (1) was effected by 
passing the phosphate buffer through the column at a flow rate 
of 0.5 ml/minute. 

The eluates were pooled for every 0.5 minute, and the 
resulting fractions were checked for their TPO and -G-CSF 
activities by an MTT assay method which will be described 
later. 

Table 6 shows elution time from Super Rose and measured 
values of TPO and G-CSF activities . 

The TPO and G-CSF activities reached the maximum after 
33.5 minutes of the elution. 
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Separately from this, thyroglobulin (molecular weight: 
670,000), aldolase (molecular weight: 160,000), bovine serum 
albumin (molecular weight: 69,000) and G-CSF (molecular weight: 
20,000) were used as the standard molecular weight proteins and 
passed through Super Rose to obtain relationship between 
elution time and molecular weight. 

Molecular weight of TPO-ND28 (1) deduced from the 33*5 
minutes of elution time was about 40,000. 

TABLE 6 

Elution time 0 20 30 32 33 33.5 34 35 37 42 

TPO activity 0.08 0.07 0-08 0.19 0.30 0.32 0.29 0.18 0.09 0.08 
(A 540 ) _ 

G-CSF 0.00 0.00 0.03 0.14 0.29 0.32 0.30 0.22 0.05 0.00 

activity 

(A 540 ) 

Test Example 2 Biological activity of TPO-CSF 

Basic construction for the measurement of the cell 
growth-stimulating activity of a solution to be tested (TPO- 
ND28 solution) upon cells to be tested is as follows. 

Each solution to be tested (TPO-ND28 solution), TPO 
standard solution and ND28 standard solution is made into 10- 
fold serial dilutions, and a 0.01 ml portion of each of the 
dilutions is added to each well of a microtiter plate. 

Actively growing cells to be tested are collected from 
a culture broth by centrif ugation, washed and then re-suspended 
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in a medium for testing use to a most suitable cell density for 
each testing. 

The thus prepared cell suspension is dispensed in 0,09 
ml portions into wells of the atJbve-mentioned microtiter plate 
which has been prepared by dispensing dilutions of the solution 
to be tested, TPO standard solution or ND28 standard solution 
in 0.01 ml portions. 

The microtiter plate is incubated at 37 °C in a 
completely moist 5% C0 2 incubator and then used in the 
following testing. 

A 0.01 ml portion of 0.5 mg/ml solution of MTT [3-(4,5- 
dimethyl-2-thiazolyl ) -2 , 5-diphenyl tetrazolium bromide] is 
added to each well, incubated for 4 hours, mixed with 0.15 ml 
of 0,1 N hydrochloric acid/isopropyl alcohol solution and then 
stirred to extract pigment from the cells, subsequently judging 
growth of the cells by measuring the amount of the pigment by 
its absorbance at 540 nm. 

This method for the measurement of cell growth- 
stimulating activity is hereinafter called the MTT assay. 
(1) Measurement of cell growth-stimulating activity upon Ba/F3 
cells 

The Ba/F3 cells which grow depending on the presence of 
mouse IL-3 were cultivated in Iscove's modified Dulbecco medium 
(hereinafter referred to as 11 IMDM" ) which has been supplemented 
with 10% heat-inactivated fetal calf serum (hereinafter 
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referred to as "FCS") and mouse IL-3 (culture supernatant of 
WEHI-3B) . 

Using the thus cultivated Ba/F3 cells, the cell growth- 
stimulating activity was measured by the MTT assay using the 
just described medium but in the absence of mouse IL-3. 

The MTT assay was carried out with an inoculation 
density of 10 , 000 cells per well and by incubating the plate in 
5% C0 2 for 48 hours. 

Results of the MTT assay showed that each of TPO, ND28 
and TPO-ND28 (1), (2) and (3) had no Ba/F3 cell growth- 
stimulating activity. 

(2) Measurement of cell growth-stimulating activity upon Ba/F3- 
cmpl 

The Ba/F3-cmpl cells which grow depending on the 
presence of mouse IL-3 or TPO were cultivated in IMDM which has 
been supplemented with 10% heat-inactivated FCS, 0,5 mg/ml of 
G418 and mouse IL-3 (culture supernatant of WEHI-3B) . 

Using the thus cultivated Ba/F3-cmpl cells, the cell 
growth- stimulating activity was measured by MTT assay using the 
just described medium but in the absence of mouse IL-3. 

The MTT assay was carried out with an inoculation 
density of 10,000 cells per well and by incubating the plate in 
5% C0 2 for 48 hours. 

Results of the MTT assay showed that each of TPO and 
TPO-ND28 (1), (2) and (3) had Ba/F3-cmp cell growth-stimulating 
activity. 
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(3) Measurement of cell growth- stimu latino activity upon NFS-60 
cells 

The NFS-60 cells which grow depending on the presence 
of human G-CSF or mouse IL-3 were cultivated in RPMI medium 
which has been supplemented with 10% heat-inactivated FCS, 2 mM 
glutamine, P/S (100 U/ml of penicillin , 100 mg/ml of 
streptomycin) and 1.0 ng/ml of recombinant type human G-CSF. 

Using the thus cultivated NFS-60 cells, the cell 
growth-stimulating activity was measured by the MTT assay using 
the just described medium but in the absence of G-CSF . 

The MTT assay was carried out with an inoculation 
density of 10,000 cells per well and by incubating the plate in 
5% C0 2 for 48 hours. 

Results of the MTT assay showed that each of ND28 and 
TPO-ND28 (1), (2) and (3) had NFS-60 cell growth-stimulating 
activity. 

Test Example 3 Effect of TPO-ND28 on mouse myeloid cells 

A BALB/c mouse of 8 weeks of age was sacrificed to 
excise the femur and tibia system whose both ends were 
subsequently cut with scissors* The needle of a syringe filled 
with RPMI solution containing 10% FCS was inserted into the 
section of femur and tibis to blow off myeloid cells into a 
small test tube, and the cells were allowed to stand for 5 
minutes . 

Using a Pasteur pipette, the supernatant fluid in the 
test tube was drawn up taking care not to contaminate it with 
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the precipitate, and the supernatant fluid was overlaid on 
Nycoprep 1.077 Animal (manufactured by NYCOMED Co., product No. 
1002380) and subjected to 15 minutes of centrifugation at 600 
g to isolate mouse mono nuclear cells (hereinafter referred to 
as "MNC" ) . 

The MNC were made into a suspension of 5xl0 5 cells/ml 
with a solution containing a solution to be tested, 10% FCS, 1% 
BSA and 0.6 mg/ml of transferrin (manufactured by Boehringer 
Manheim Co.) and cultivated for 5 days in a C0 2 incubator (BNA- 
12QD, manufactured by TABAI Co.) under conditions of 37°C, 5% 
C0 2 and 95% or more of humidity. 

As the solution to be tested, a solution of TPO, ND28 
or TPO-ND28 having a final concentration of 1.0, 10 or 100 
ng/ml or a solution in which the same volume of TPO and ND-28 
solutions having the above-mentioned concentration were mixed 
(TPO/ND28) was used. The TPO and ND28 obtained in Example 3 
were used . 

After completion of the cultivation, conditions of the 
differentiation of MNC were examined by measuring the amount of 
CD61 expressed which is an index of differentiation into 
megakaryocyte system [J". Med., 311 , 1084 (1984)] and the 
amount of Gr-1 expressed which is an index of differentiation 
into the granulocyte system [J". Immunol., 144 , 22 (1991)]. 

After staining with anti mouse CD61-FITC monoclonal 
antibody (manufactured by PHARMINGEN Co., product No. 01864D) 
and anti mouse Gr-l-PE monoclonal antibody (manufactured by 
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PHARMINGEN Co,, product No. 01215A) , expressed amounts of CD61 
and Gr-1 were measured using an ELITE flow cytometer 
(manufactured by Coulter Co, ) . 

The results are shown in Table 7 , 

TABLE 7 

Solution to be tested Concentration Expressed cells (i) 





(ng/ml) 


Gr-1 


CD61 


Tin arfH i +~ i rm 




i n 
x • u 


i n 


ND28 


1.0 


49. 1 


7.6 




10.0 


40.7 


4.9 




100.0 


44.5 


4.6 


TPO 


1.0 


36 .7 


8.7 




10.0 


37.7 


17.8 




100.0 


37.1 


21.9 


TPO/ND28 


1.0 


50.7 


10.3 




10.0 


40.6 


10.4 




100.0 


49.2 


5.7 


TPO-ND28 


1.0 


50.5 


22.1- 




10.0 


49.8 


26.6 




100.0 


41.0 


18.8 



When the solution to be tested prepared by mixing the 
same amount of TPO and ND28 (TPO/ND28) was added, Gr-1 
expressed cells were generated in a level similar to the case 



- 54 - 



of the addition of the solution to be tested containing ND28 
alone, thus showing differentiation of MNC into the granulocyte 
system, but frequency of the generation of CD61 expressed cells 
was lower than the case of the addition of the solution to be 
tested containing TPO alone, thus showing decreased 
differentiation into the megakaryocyte system. These results 
suggest that, when the same amount of TPO and ND28 are present, 
MNC reacts mostly with ND28 and differentiates into the 
granulocyte system. 

However f when the fusion polypeptide of TPO and ND28, 
namely TPO-ND28, was added as the solution to be tested, 
frequency of the generation of CD61 expressed cells was similar 
to or higher than the case of the addition of the solution to 
be tested containing TPO alone and two times or more higher 
than the case of the addition of TPO/ND28. What is more, the 
frequency of the generation of Gr-1 expressed cells was also 
similar to the case of the addition of the solution to be 
tested containing ND28 alone. 

Test Example 4 Platelet and leukocyte production-enhancing 
function in mice 

A 10ng/ml solution of TPO or a 10 jig/ml solution of 
TPO-ND28 (3) obtained in Example 3 was administered by 
subcutaneous injection to BALB/c mice (males, 7 weeks of age) 
with a dose of 0.2 ml per 20 g body weight of each mouse, once 
a day continuously for 4 days starting on the first day of the 
test (treated groups, 4 animals per one group). A blood sample 



was collected from the ophthalmic vein of each animal on the 
fifth day of the test to count the number of platelets and 
leukocytes by a microcell counter (Sysmex F800, manufactured by 
Toa Iyo Denshi Co . ) ♦ 

After introducing the plasmid pAGE210 used for the 
expression of TPO or TPO-ND28 (3) gene into CHO cells in 
accordance with the method described in Example 2-2 ) f the cells 
were cultivated, the resulting culture supernatant was treated 
by the same TFO-ND28 (3) purification procedure described in 
Example 3, and an elution fraction corresponding to the elution 
fraction of TPO-ND28 (3) was used as a blank solution to count 
the number of platelets and leukocytes by the above-mentioned 
method . 

In order to compare and examine effects of TPO and TPO- 
ND28 (3), the increasing ratio (%) of the number of platelets 
and leukocytes in the group in which each of these substances 
were administered to that in the blank solution-administered 
group was calculated based on the following formula: 
[platelet or leukocyte counts in mice of TPO- or TPO-ND28 (3)- 
administered group] / [platelet or leukocyte count in mice of 
blank solution-administered group] x 100 

The results are shown in Table 8. 
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TABLE 8 



Test substance 



Increasing ratio 
of platelets 



Increasing ratio 
of leukocytes 



(%) 



(%) 



TPO 



219 



106 



TPO-ND28 



170 



160 



INDUSTRIAL APPLICABILITY 

A fusion polypeptide comprising a polypeptide having 
both G-CSF activity and a polypeptide having TPO activity is 
provided by the present invention. The fusion polypeptide of 
the present invention can form and amplify platelets and 
leukocytes simultaneously and can control formation of 
megakaryocyte colonies and neutrophil colonies and 
differentiation or maturation of megakaryocyte precursors and 
neutrophil precursors . 
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SEQUENCE LISTING 

Sequence ID No.: 1 
Sequence Length: 328 
Sequence Type: amino acid 
Strandedness : single 
Topology: linear 

Molecular Type: peptide ' 
Original Source 

Organism: human (Homo sapiens) 
Sequence Characteristics : 

Designation: 

Location : 1 . . 154 

Designation: 

Location: 154 . ♦ 328 

Sequence 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 

15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 

20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 

35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 

50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 
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Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 

85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 

100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 

115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 

130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala Pro Thr Tyr Arg Ala 
145 150 155 160 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 

165 170 175 

Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 

180 185 190 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 

195 200 205 

Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 

210 215 220 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
225 230 235 240 

Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr 

245 250 255 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp 

260 265 270 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
275 280 285 
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Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 

290 295 300 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
305 310 315 320 

Val Leu Arg His Leu Ala Gin Pro 

325 



Sequence ID No.: 2 
Sequence Length: 340 
Sequence Type: amino acid 
Strandedness : single 
Topology: linear 
Molecular Type: peptide 
Original Source 

Organism: human (Homo sapiens) 
Sequence Characteristics: 

Designation: 

Location: 1..153 

Designation: 

Location: 167.. 340 

Sequence 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 

15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 
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His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 

35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 

50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 

85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 

100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 

115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 

130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly Gly Ser Gly Gly Gly 
145 150 155 160 

Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg Ala Ser Ser Leu Pro 

165 170 175 

Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys lie Gin Gly 

180 185 190 

Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 

195 200 205 

His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly lie Pro Trp 

210 215 220 

Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
225 230 235 240 
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Leu Ser Gin Leu His 

245 

Ala Leu Glu Gly lie 
260 

Gin Leu Asp Val Ala 
275 

Glu Leu Gly Met Ala 
290 

Ala Phe Ala Ser Ala 
305 

Ser His Leu Gin Ser 

325 

Leu Ala Gin Pro 
340 



Ser Gly Leu Phe Leu 

250 

Ser Pro Glu Leu Gly 
265 

Asp Phe Ala Thr Thr 
280 

Pro Ala Leu Gin Pro 
295 

Phe Gin Arg Arg Ala 
310 

Phe Leu Glu Val Ser 

330 



Tyr Gin Gly Leu Leu Gin 

255 

Pro Thr Leu Asp Thr Leu 
270 

He Trp Gin Gin Met Glu 
285 

Thr Gin Gly Ala Met Pro 
300 

Gly Gly Val Leu Val Ala 
315 320 
Tyr Arg Val Leu Arg His 

335 



Sequence ID No • : 3 
Sequence Length: 344 
Sequence Type: amino acid 
Strandedness : single 
Topology: linear 
Molecular Type: peptide 
Original Source 

Organism: human (Homo sapiens) 
Sequence Characteristics : 

Designation: 

Location: 1 . . 153 
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Designation: 
Location: 171 . . 344 

Sequence 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 

15 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 

20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 

35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 

50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 

85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 

100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 

115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 

130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly Gly Gly Ser Gly Gly 
145 150 155 160 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala Pro Thr Tyr Arg Ala 

165 170 175 
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Ser Ser Leu Pro 
180 

Lys lie Gin Gly 
195 

Tyr Lys Leu Cys 
210 

Gly lie Pro Trp 
225 

Leu Ala Gly Cys 

Gly Leu Leu Gin 
260 

Leu Asp Thr Leu 
275 

Gin Gin Met Glu 
290 

Gly Ala Met Pro 
305 

Val Leu Val Ala 

Val Leu Arg His 
340 



Gin Ser Phe Leu 

Asp Gly Ala Ala 
200 

His Pro Glu Glu 
215 

Ala Pro Leu Ser 
230 

Leu Ser Gin Leu 
245 

Ala Leu Glu Gly 

Gin Leu Asp Val 
280 

Glu Leu Gly Met 
295 

Ala Phe Ala Ser 
310 

Ser His Leu Gin 
325 

Leu Ala Gin Pro 



Leu Lys Ser Leu 
185 

Leu Gin Glu Lys 

Leu Val Leu Leu 
220 

Ser Cys Pro Ser 
235 

His Ser Gly Leu 
250 

lie Ser Pro Glu 
265 

Ala Asp Phe Ala 

Ala Pro Ala Leu 
300 

Ala Phe Gin Arg 
315 

Ser Phe Leu Glu 
330 



Glu Gin Val Arg 
190 

Leu Cys Ala Thr 
205 

Gly His Ser Leu 

Gin Ala Leu Gin 
240 

Phe Leu Tyr Gin 
255 

Leu Gly Pro Thr 
270 

Thr Thr lie Trp 
285 

Gin Pro Thr Gin 

Arg Ala Gly Gly 
320 

Val Ser Tyr Arg 
335 



Sequence ID No . : 4 
Sequence Length: 104 7 
Sequence Type: nucleic acid 
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Strandedness : double 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Original Source 

Organism: human (Homo sapiens) 
Sequence Characteristics : 

Designation: sig peptide 

Location: 1 . . 63 

Designation: CDS 

Location: 64, .1047 

Sequence 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 48 
Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

-20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 144 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 

15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 19 2 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 



- 65 - 



GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 240 
Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 

45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 
Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 
Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 

95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 432 
Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

110 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 480 
Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG CGG GCG 528 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala 
140 145 150 155 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 57 6 
Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 

160 165 170 
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TTA GAG CAA GTG 
Leu Glu Gin Val 
175 

AAG CTG TGT GCC 
Lys Leu Cys Ala 
190 

CTC GGA CAC TCT 
Leu Gly His Ser 
205 

AGC CAG GCC CTG 
Ser Gin Ala Leu 
220 

CTT TTC CTC TAC 
Leu Phe Leu Tyr 

GAG TTG GGT CCC 
Glu Leu Gly Pro 
255 

GCC ACC ACC ATC 
Ala Thr Thr lie 
270 

CTG CAG CCC ACC 
Leu Gin Pro Thr 
285 



AGG AAG ATC CAG 
Arg Lys lie Gin 

ACC TAC AAG CTG 
Thr Tyr Lys Leu 
195 

CTG GGC ATC CCC 
Leu Gly lie Pro 
210 

CAG CTG GCA GGC 
Gin Leu Ala Gly 
225 

CAG GGG CTC CTG 
Gin Gly Leu Leu 
240 

ACC TTG GAC ACA 
Thr Leu Asp Thr 

TGG CAG CAG ATG 
Trp Gin Gin Met 
275 

CAG GGT GCC ATG 
Gin Gly Ala Met 
290 



GGC GAT GGC GCA GCG 
Gly Asp Gly Ala Ala 
180 

TGC CAC CCC GAG GAG 
Cys His Pro Glu Glu 

200 

TGG GCT CCC CTG AGC 
Trp Ala Pro Leu Ser 
215 

TGC TTG AGC CAA CTC 
Cys Leu Ser Gin Leu 
230 

CAG GCC CTG GAA GGG 
Gin Ala Leu Glu Gly 
245 

CTG CAG CTG GAC GTC 
Leu Gin Leu Asp Val 
260 

GAA GAA CTG GGA ATG 
Glu Glu Leu Gly Met 

280 

CCG GCC TTC GCC TCT 
Pro Ala Phe Ala Ser 
295 



CTC CAG GAG 624 
Leu Gin Glu 
185 

CTG GTG CTG 67 2 
Leu Val Leu 

AGC TGC CCC 7 20 
Ser Cys Pro 

CAT AGC GGC 768 
His Ser Gly 
235 

ATC TCC CCC 816 
lie Ser Pro 
250 

GCC GAC TTT 864 
Ala Asp Phe 
265 

GCC CCT GCC 912 
Ala Pro Ala 

GCT TTC CAG 960 
Ala Phe Gin 
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CGC CGG GCA GGA GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG 1008 
Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
300 305 310 315 

GAG GTG TCG TAC CGC GTT CTA CGC CAC CTT GCC CAG CCC 1047 
Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 

320 325 

Sequence ID No . : 5 
Sequence Length: 1083 
Sequence Type: nucleic acid 
Strandedness : double 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Original Source 

Organism: human (Homo sapiens) 
Sequence Characteristics : 

Designation: sig peptide 

Location: 1 . . 63 

Designation: CDS 

Location: 64., 1083 

Sequence 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 48 
Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-20 -15 -10 
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AGG CTA ACG CTG 
Arg Leu Thr Leu 
-5 

CTC AGT AAA CTG 
Leu Ser Lys Leu 
15 

CAG TGC CCA GAG 
Gin Cys Pro Glu 
30 

GTG GAC TTT AGC 
Val Asp Phe Ser 
45 

GCA CAG GAC ATT 
Ala Gin Asp lie 
60 

GCA GCA CGG GGA 
Ala Ala Arg Gly 

CAG CTT TCT GGA 
Gin Leu Ser Gly 
95 

CTT GGA ACC CAG 
Leu Gly Thr Gin 
110 



TCC AGC CCG GCT 
Ser Ser Pro Ala 
1 

CTT CGT GAC TCC 
Leu Arg Asp Ser 

GTT CAC CCT TTG 
Val His Pro Leu 
35 

TTG GGA GAA. TGG 
Leu Gly Glu Trp 
50 

CTG GGA GCA GTG 
Leu Gly Ala Val 
65 

CAA CTG GGA CCC 
Gin Leu Gly Pro 
80 

CAG GTC CGT CTC 
Gin Val Arg Leu 

CTT CCT CCA CAG 
Leu Pro Pro Gin 
115 



CCT CCT GCT TGT GAC 
Pro Pro Ala Cys Asp 
5 

CAT GTC CTT CAC AGC 
His Val Leu His Ser 
20 

CCT ACA CCT GTC CTG 
Pro Thr Pro Val Leu 

40 

AAA ACC CAG ATG GAG 
Lys Thr Gin Met Glu 
55 

ACC CTT CTG CTG GAG 
Thr Leu Leu Leu Glu 
70 

ACT TGC CTC TCA TCC 
Thr Cys Leu Ser Ser 
85 

CTC CTT GGG GCC CTG 
Leu Leu Gly Ala Leu 
100 

GGC AGG ACC ACA GCT 
Gly Arg Thr Thr Ala 

120 



CTC CGA GTC 96 
Leu Arg Val 
10 

AGA CTG AGC 144 
Arg Leu Ser 
25 

CTG CCT GCT 192 
Leu Pro Ala 

GAG ACC AAG 240 
Glu Thr Lys 

GGA GTG ATG 288 
Gly Val Met 
75 

CTC CTG GGG 336 
Leu Leu Gly 
90 

CAG AGC CTC 384 
Gin Ser Leu 
105 

CAC AAG GAT 432 
His Lys Asp 
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CCC AAT GCC ATC TTC CTG AGO TTC CAA CAC CTG CTC CGA GGA AAG GTG 480 
Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTC AGG GGT GGC 528 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 
140 145 150 155 

GGT TCT GGA GGT GGT TCC GGA GGG GGT TCT AGA GCA CCA ACA TAT CGC 57 6 
Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 

160 165 170 

GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC TTA GAG CAA GTG 624 
Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 

175 180 185 

AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC 672 
Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 

190 195 200 

ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC TCT 720 
Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

205 210 215 

CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC AGC CAG GCC CTG 768 
Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
220 225 230 235 

CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC 816 
Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 

240 245 250 
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CAG GGG CTC CTG GAG GCC CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC 8G4 
Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro 

255 260 265 

ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 912 
Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie 

270 275 280 

TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG CAG CCC ACC 960 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 

285 290 295 

CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG CGC CGG GCA GGA 1008 
Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 
300 305 310 315 

GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG GAG GTG TCG TAC 1056 
Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 

320 325 330 

CGC GTT CTA CGC CAC CTT GCC CAG CCC 1083 
Arg Val Leu Arg His Leu Ala Gin Pro 

335 340 

Sequence ID No . : 6 
Sequence Length: 1095 
Sequence Type: nucleic acid 
Strandedness : double 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Original Source 



- 71 - 



Organism: human (Homo sapiens) 
Sequence Characteristics : 

Designation: sig peptide 
Location: 1 . . 63 
Designation: CDS 
Location: 64 , . 1095 

Sequence 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 48 
Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

-20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 9 6 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 144 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 

15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 19 2 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 

30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 240 
Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 

45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 
Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 



- 72 - 



GCA GCA CGG GGA 
Ala Ala Arg Gly 

CAG CTT TCT GGA 
Gin Leu Ser Gly 
95 

CTT GGA ACC CAG 
Leu Gly Thr Gin 
110 

CCC AAT GCC ATC 
Pro Asn Ala lie 
125 

CGT TTC CTG ATG 
Arg Phe Leu Met 
140 

GGT GGC TCT GGC 
Gly Gly Ser Gly 

CCA ACA TAT CGC 
Pro Thr Tyr Arg 
175 

TTA GAG CAA GTG 
Leu Glu Gin Val 
190 



CAA CTG GGA CCC 
Gin Leu Gly Pro 
80 

CAG GTC CGT CTC 
Gin Val Arg Leu 

CTT CCT CCA CAG 
Leu Pro Pro Gin 
115 

TTC CTG AGC TTC 
Phe Leu Ser Phe 
130 

CTT GTA GGA GGG 
Leu Val Gly Gly 
145 

GGT GGT TCT GGT 
Gly Gly Ser Gly 
160 

GCC TCG AGT CTA 
Ala Ser Ser Leu 

AGG AAG ATC CAG 
Arg Lys lie Gin 
195 



ACT TGC CTC TCA TCC 
Thr Cys Leu Ser Ser 
85 

CTC CTT GGG GCC CTG 
Leu Leu Gly Ala Leu 
100 

GGC AGG ACC ACA GCT 
Gly Arg Thr Thr Ala 

120 

CAA CAC CTG CTC CGA 
Gin His Leu Leu Arg 
135 

TCC ACC CTC TGC GTA 
Ser Thr Leu Cys Val 
150 

GGC GGC TCC GGA GGC 
Gly Gly Ser Gly Gly 
165 

CCA CAG AGC TTC CTT 
Pro Gin Ser Phe Leu 
180 

GGC GAT GGC GCA GCG 
Gly Asp Gly Ala Ala 

200 



CTC CTG GGG 336 
Leu Leu Gly 
90 

CAG AGC CTC 384 
Gin Ser Leu 
105 

CAC AAG GAT 432 
His Lys Asp 

GGA AAG GTG 480 
Gly Lys Val 

CGG TCC GGA 528 
Arg Ser Gly 
155 

GGT CGT GCG 576 
Gly Arg Ala 
170 

TTA AAA AGC 624 
Leu Lys Ser 
185 

CTC CAG GAG 672 
Leu Gin Glu 
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AAG CTG TGT GCC 

Lys Leu Cys Ala 
205 

CTC GGA CAC TCT 

Leu Gly His Ser 
220 

AGC CAG GCC CTG 

Ser Gin Ala Leu 

CTT TTC CTC TAC 
Leu Phe Leu Tyr 
255 

GAG TTG GGT CCC 
Glu Leu Gly Pro 
270 

GCC ACC ACC ATC 
Ala Thr Thr lie 
285 

CTG CAG CCC ACC 
Leu Gin Pro Thr 
300 

CGC CGG GCA GGA 
Arg Arg Ala Gly 



ACC TAC AAG CTG 
Thr Tyr Lys Leu 
210 

CTG GGC ATC CCC 
Leu Gly lie Pro 
225 

CAG CTG GCA GGC 
Gin Leu Ala Gly 
240 

CAG GGG CTC CTG 
Gin Gly Leu Leu 

ACC TTG GAC ACA 
Thr Leu Asp Thr 
275 

TGG CAG CAG ATG 
Trp Gin Gin Met 
290 

CAG GGT GCC ATG 
Gin Gly Ala Met 
305 

GGG GTC CTA GTT 
Gly Val Leu Val 
320 



TGC CAC CCC GAG GAG 
Cys His Pro Glu Glu 
215 

TGG GCT CCC CTG AGC 
Trp Ala Pro Leu Ser 
230 

TGC TTG AGC CAA CTC 
Cys Leu Ser Gin Leu 
245 

CAG GCC CTG GAA GGG 
Gin Ala Leu Glu Gly 
260 

CTG CAG CTG GAC GTC 
Leu Gin Leu Asp Val 

280 

GAA GAA CTG GGA ATG 
Glu Glu Leu Gly Met 
295 

CCG GCC TTC GCC TCT 
Pro Ala Phe Ala Ser 
310 

GCC TCC CAT CTG CAG 
Ala Ser His Leu Gin 
325 



CTG GTG CTG 720 
Leu Val Leu 

AGC TGC CCC 768 
Ser Cys Pro 
235 

CAT AGC GGC 816 
His Ser Gly 
250 

ATC TCC CCC 864 
lie Ser Pro 
265 

GCC GAC TTT 912 
Ala Asp Phe 

GCC CCT GCC 960 
Ala Pro Ala 

GCT TTC CAG 1008 
Ala Phe Gin 
315 

AGC TTC CTG 1056 
Ser Phe Leu 
330 
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GAG GTG TCG TAG CGC GTT CTA CGC CAC CTT GCC CAG CCC 
Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 



Sequence ID No.: 7 
Sequence Length: 44 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: sig peptide 

Location : 27 . . 44 

Sequence 

CTCTCCAAGC TTGAATTCCG GCCAGAATGG AGCTGACTGA ATTG 

Sequence ID No. i 8 
Sequence Length: 47 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS 

Location: 23 • .47 



Sequence 

GTAGAGGTAC CGCGGCCGCT TACCCTTCCT GAGACAGATT CTGGGAG 



47 



Sequence ID No.: 9 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS 

Location: 1 . . 24 

Sequence 

TGAACCTCTG GGCACTGGCT CAGT 

Sequence ID No,: 10 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic aci d/ synthetic DNA 
Sequence Characteristics : 

Designation : CDS 

Location: 1 . . 24 



24 
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Sequence 

GCTGCCTGCT GTGGACTTTA GCTT 

Sequence ID Wo.: 11 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid f synthetic DNA 
Sequence Characteristics : 

Designation: CDS 

Location: 1 . . 24 

Sequence 

TGTTGGAAGC TCAGGAAGAT GGCA 

Sequence ID No.: 12 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS 

Location: 1 ♦ . 24 



Sequence 

CCTGATGCTT GTAGGAGGGT CCAC 

Sequence ID No.: 13 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology : linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS 

Location: 1 . . 24 

Sequence 

TCAAGAGTTC GTGTATCCTG TTCA 

Sequence ID No.: 14 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: CDS 

Location: 1 . . 24 



Sequence 

GAATGGAACT CGTGGACTCT TTCC 

Sequence ID No.: 15 
Sequence Length: 17 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence 

GTAAAACGAC GGCCAGT 

Sequence ID No,: 16 
Sequence Length: 17 
Sequence Type: nucleic acid 
S trandednes s : s ingle 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence 

CAGGAAACAG CTATGAC 

Sequence ID No,: 17 
Sequence Length: 13 
Sequence Type: amino acid 
Strandedness : single 
Topology: linear 



Molecular Type: peptide 
Sequence 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg 
15 10 

Sequence ID No,: 18 
Sequence Length: 66 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS 

Location: 1 , . 3 

Designation: CDS 

Location: 43 . .66 

Sequence 

TGCTCTAGAA CCGCCTCCGG AACCACCTCC AGAACCGCCA CCCCTGACGC AGAGGGTGGA 60 
CCCTCC 66 

Sequence ID No,: 19 
Sequence Length: 45 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid f synthetic DNA 
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Sequence Characteristics : 
Designation: CDS 
Location : 22 . . 45 

Sequence 

GGTTCCGGAG GCGGTTCTAG AGCACCAACA TATCGCGCCT CGAGT 

Sequence ID No.: 20 
Sequence Length: 48 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: CDS 

Location: 28 . . 48 

Sequence 

CATTCCGCGG GGTACCGCGG CCGCTCAGGG CTGGGCAAGG TGGCGTAG 

Sequence ID No,: 21 
Sequence Length*: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 
Designation: CDS 



Location: 1 . . 24 

Sequence 

GGCTGCTTGA GCCAACTCCA TAGC 



24 



Sequence ID No,: 22 
Sequence Length: 24 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: CDS 

Location: 1 . . 24 

Sequence 

GACCCAACTC GGGGGAGATC CCTT 24 

Sequence ID No.: 23 
Sequence Length: 57 
Sequence Type: nucleic acid 
Strandedness: single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: CDS 

Location: 1 , . 27 

Designation: CDS 
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Location: 28. .57 
Designation : mutation 
Location: 25 
Des ignation : mutation 
Location: 33 . . 34 

Sequence 

TAGACTCGAG GCGCGATATG TTGGCGCCCG CCGTACGCAG AGGGTGGACC CTCCTAC 57 

Sequence ID No.: 24 
Sequence Length: 17 
Sequence Type: amino acid 
S trandednes s : single 
Topology: linear 
Molecular Type: peptide 
Sequence 

Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg 
1 5 10 15 

Sequence ID No.: 25 
Sequence Length: 61 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics : 

Designation: CDS of TPO 
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Location : 1 . . 6 
Designation: linker peptide 
Location: 7.* 57 
Designation: CDS of ND28 
Location : 58 . . 6 1 
Designation: SplI 
Location: 1 . .5 
Designation: Mrol 
Location: 7.. 12 
Designation: Mrol 
Location: 43.. 48 
Designation: Bbel 
Location: 58 • ♦ 61 
Designation : mutation 
Location: 4 . *5 

Sequence 

GTACGGTCCG GAGGTGGCTC TGGCGGTGGT TCTGGTGGCG GCTCCGGAGG CGGTCGTGCG C 61 

Sequence ID No.: 26 
Sequence Length: 53 
Sequence Type: nucleic acid 
Strandedness : single 
Topology: linear 

Molecular Type: other nucleic acid, synthetic DNA 
Sequence Characteristics: 

Designation: CDS of TPO 
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Location : 5 2 • . 5 3 



Designation: linker peptide 
Location : 1 . • 5 1 



Designation: SplI 
Location: 53 



Designation: Mrol 
Location: 10 . . 15 



Designation: Mrol 
Location: 46 . . 51 



Sequence 

h -J 

ACGACCGCCT CCGGAGCCGC CACCAGAACC ACCGCCAGAG CCACCTCCGG ACC 53 

CO 
O 
HI 



O 

a 
m 
o 
o 



CLAIMS 

1. A fusion polypeptide which comprises a polypeptide 
having a human granulocyte colony stimulating factor activity 
and a polypeptide having a platelet growth factor activity. 

2. A fusion polypeptide in which a polypeptide having 
a human granulocyte colony stimulating factor activity and a 
polypeptide having a platelet growth factor activity are fused 
via a spacer peptide . 

3. The fusion polypeptide according to claim 1 wherein 
the fusion polypeptide is a polypeptide which contains an amino 
acid sequence selected from the amino acid sequences shown in 
Sequence ID Nos . 1, 2, 3, 4, 5 and 6. 

4 . A fusion polypeptide in which one or more amino 
acids are added, deleted or substituted in the amino acid 
sequence of the fusion polypeptide disclosed in claim 1, 2 or 
3, and which has a human granulocyte colony stimulating factor 
activity and a platelet growth factor activity. 

5. A fusion polypeptide having a human granulocyte 
colony stimulating factor activity and a platelet growth factor 
activity in which at least one amino group of the fusion 
polypeptide disclosed in claim 1, 2, 3 or 4 is chemically 
modified with a polyalkylene glycol derivative. 

6 . The fusion polypeptide according to claim 5 wherein 
the polyalkylene glycol derivative is a polyethylene glycol 
derivative, a polypropylene glycol derivative or a 
polyoxyethylene-polyoxypropylene copolymer derivative. 
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7. A DNA which codes for the fusion polypeptide 
disclosed in claim I, 2, 3 or 4, 

8 . The DNA according to claim 6 wherein the DNA is a 
DNA which contains a sequence selected from the DNA sequences 
shown in Sequence ID Nos • 4 , 5 and 6 . 

9 • An anemia- treating composition containing the 
fusion polypeptide disclosed in claim 1, 2, 3, 4 or 5 as an 
active ingredient . 
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ABSTRACT 

The present invention relates to a fusion polypeptide 
which comprises a polypeptide having G-CSF activity and a 
polypeptide having TPO activity and DNA which codes for the 
fusion polypeptide, to a fusion polypeptide in which a 
polypeptide having G-CSF activity and a polypeptide having TPO 
activity are fused via a spacer peptide and DNA which codes for 
the fusion polypeptide and to a polypeptide in which the fusion 
polypeptide comprising a polypeptide having G-CSF activity and 
a polypeptide having TPO activity is chemically modified with 
a polyalkylene glycol derivative. It also relates to an 
anemia-treating composition containing the fusion polypeptide 
as an active ingredient. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Patent Application of 

YOKOI et al Atty. Ref . : 249- 1 1 8 

Serial No. to be assigned Group: 1646 

Filed: October 6, 2000 Examiner: Mertz 

For: HG-CSF FUSION POLYPEPTIDE HAVING C-MPL 

ACTIVITY, DNA CODING FOR SAME AND METHODS 
OF TREATING ANEMIA USING S AME(AS AMENDED) 

Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

DECLARATION OF DEPOSITED MATERIALS 

As an authorized representative and on the instructions of Kyowa Hakko Kogyo 
Co., Ltd., a Japanese corporation of 6-1, Ohtemachi 1-chome, Chiyoda-ku, Tokyo, Japan, 
the assignee of the U.S. Patent application identified above, I hereby declare that: 

• Kyowa Hakko Kogyo Co., Ltd. is the depositor and owner of the following 
deposits of biological materials identified and referred to in the specification of 
this application and on the attached deposit receipts, international form, under the 
terms of the Budapest Treaty. 

Accession No. FERM- Depositor's Reference Date Deposited 

FERM BP-5001 Escherichia coli TLN-1 February 16, 1995 

• The deposits of biological material identified above were made at the National 
Institute of Bioscience and Human Technology; Agency of Industrial Science and 
Technology, Ibaraki, Japan, and have been deposited and accepted under the terms 
of the Budapest Treaty on the International Recognition of the Deposit of 



464094 



YOKOI et al 

Serial No. to be assigned 



Microorganisms for the Purposes of Patent Procedure and therefore the filing of a 
viability statement is unnecessary [37 C.F.R. 1.807(b)] 

• The deposits will be maintained for a period of 30 years from the date of deposit 
or for the enforceable life of any patent issuing from this application or for a 
period of 5 years after the date of the most recent request for the furnishing of a 
sample of the deposited material, whichever is longest. 

• The deposits will be replaced should they become contaminated or no longer 



• Subject to 37 C.F.R. § 1 .808(b), all restrictions imposed by the depositor on the 
availability to the public of the deposited material will be irrevocably removed 
upon the granting of the patent. 



identified patent application to one determined by the Commissioner of Patents 
and Trademarks to be so entitled under 37 C.F.R. § 1.14 and 35 U.S.C. § 122. 
I further declare that all statements made herein of my own knowledge are true 
and that all statement made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under Section 101 of Title 
18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issuing thereon. 



viable. 



Access to the deposited material is permitted during the pendency of the above- 



By: 




Arthur R. Crawford 
Reg. No. 25,327 
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RULE 63 (37 C.F.R. 1.63) NiTOn * v****^ rc (6/92) 

DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

As a below named inventor, I hereby declare that my residence, post office address and citizenship are as stated below next to mv name; and I 
*rve I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names are 
A below) of the subject matter which is claimed and for which a patent is sought on the invention entitled _ 

NOVEL POLYPEPTIDES 

the specification of which (check applicable box(es)): 

[ X ] is attached hereto. 

[ ] was filed on as U.S. Application Serial No 



[ X] was filed as PCT international application No. PrT/ JP96 / 01l57 April 26 „ 1996 
and (if applicable to U.S. or PCT application) was amended on 



I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as amended by any 
amendment referred to above. I acknowledge the duty to disclose information which is material to the examination of this application in 
accordance with 37 C.F.R. 1.56(a). I hereby claim foreign priority benefits under 35 U.S.C 119/365 of any foreign application s) for patent or 
inventor's certificate listed below and have also identified below any foreign application for patent or inventors certificate having a filing date 
before that of the application on which priority is claimed or, if no priority is claimed, before the filing date of this application: 
Prior Foreign Application (s): 

Application Number Country Day/Month/Year Filed 

Hei. 7-102625 Japan 26/April/1995 



I hereby claim the benefit under 35 U.S.C 120/365 of ail prior United States and PCT international applications listed above or beiow and, insofar 
as -the subject matter of each of the claims of this application is not disclosed in such prior applications in the manner provided by the first 
paragraph of 35 U.S.C 112, I acknowledge the duty to disclose material information as defined in 37 GRR. 1.56(a) which occurred between the 
fikng date of the prior applications and the national or PCT international filing date of this application: 

Prior U.S./PCT Appiication(s): Status: patented, 

Application Serial No. Day /Month/Year Filed pending, abandoned 



f nereby declare that all statements made herein of my own knowledge are crue and chat all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued thereon. 

£|d I hereby appoint NIXON & VANDERHYE P.C., 1100 North Glebe Road, 8th Floor, Arlington, Virginia 22201-4714, telephone 
fnimber (703) 816-4000 (to whom all communications are to be directed), and the following attorneys thereof (of the same address) individually 
fliH collectively my attorneys to prosecute this application and to transact all business in the Patent and Trademark Office connected therewith and 
Wfth the resulting patent: Arthur R. Crawford, 25327; Larry S. Nixon, 25640; Robert A. Vanderhye, 27076; James T. Hosmer, 30184; Robert W. 
facts, 31352; Richard G. Besha, 22770; Mark E. Nusbaum, 32348; Michael J. Keenan, 32106; Bryan H. Davidson, 30251; Stanley C Spooner, 
27393; Leonard C Mitchard, 29009; Duane M. Byers, 33363; Paul J. Henon, 33626; Jeffry H. Nelson, 30481; John R Lastova, 33149; H. Warren 
Burnam, Jr., 29366; Thomas E. Byrne, 32205. 

1) Inventor's Signature ^hlai^Ji^i/^ rw December 16, 1996 

Inventor's Name (typed) Haruhiko Yokoi Japan 

First Middle Initial Family Name Citizenship 

Residence (City) Ibaraki (State/Foreign Country) Japan 



Post Office Address 1-23-5, Sengen/ Tsukuba-shi , Ibaraki, Japan Zip Code_305__ 

2) Inventor's Signature /ffi<&*s« ^ ^4^^ ? ^r r^fp December 16 , 1996 

Inventor's Name (typed) Yukimasa Shiotsu Ja pan 

First Middle Initial Family Name Citizenship 

Residence (City) T ° ky ° (State/ Foreign Country) Japan 



Post Office Address 1-17-20, Kinuta, Setacraya-ku , Tokyo, Japan Zip Code_15X__ 

Inventor's Signature &^C/1r^'±^+^ j^=^ — rw* December 16/ 1996 

Inventor's Name (typed) Noboru Konishi Japan 

First Middle Initial Family Name Citizenship 

Residence (City) Yamaguchi (State/ Foreign Country) Japan . 



Post Office Address 535-12, Shinden, Hohu-shi, Yamaguchi, Japan Zip 747 



FOR ADDITIONAL INVENTORS, check box [X ] and attach sheet with same information and signature and date for each. 



4) Inventors Signature 

Inventors Name (typed) Hideharu 




Anazawa 



Residence (Gry) 



First 

Tokyo 



Middle Initial Family Name 
(State/Foreign Country) Japan 



rw December 16, 1996 
Japan 



Grizenship 



Post Office Address 4-19-18, Mjnaitii-ohizumi , Nexima-ku, Tokyo, Japan 

5) Inventor's Signature 



Inventor s Name (ryped) T ^tsuya 




Tamaoki 



Residence (Gry) 



First 

Tokyo 



Middle Initial Family Name 
(State/Foreign Country) Japan 



Zip rnrU 178 

r^r^ December 16, 1996 
Japan 



Citizenship 



Post Office Address 2662-13, Honmachida, Machida-shi , Tokyo , Japan 

6) Inventor's Signature _ 



Zip Code, 



194 



rw* December 16, 1996 



Inventors Name (typed) Motoo 



Residence (Gry) . 



First 

Tokyo 



Yamasaki 



Middle Initial Family Name 
(State/Foreign Country) Japan 



Japan 



Citizenship 



Post Office Address 3-9-13, Naka-machi, Machida-shi, Tokyo, Japan 
7) Inventors Signature 



Inventor's Name (ryped) Yoko 
First 

Tokyo 



Zip Code_121__ 

rw December 16. 1996 
Japan 



Residence (Gry) 



Middle Initial Family Name Grizenship 

Japan 



. (Stare/Foreign Country) . 



Post Office Address 1392-5, Honmachida, Machida-shi, Tokyo, Japan 



Zip Code. 



194 



i 8) Inventor's Signature 



Inventor s Name (typed) Kazuhisa 



rw December 16, 1996 

Uchida Japan 



Pirst Middle Initial Family Name 
Residence (Gry) Tok Y° — (Stace/Foreign Country) Japan 



Grizenship 



Post Office Address 2-12-3, Naruse , Machida-shi, Tokyo, Japan 
9) Inventors Signature ^/LnYlClA/j'£& 



Inventor s Name (ryped) K ^ n Y a 



7? r 194 

rw December 16 . 1996 

Yamashita Japan 



Residence (Gry) 



Firsr Middle Initial Family Name 
Shizuoka (State/Foreign Country) Japan 



Grizenship 



Post Office Address 25-2, Sasahara-shinden , Mishima-shi, Shizuoka, Japan 



Zip Code 411 



10) Inventor's Signature 



Date. 



Inventors Name (typed) 



First 



Residence (Gry) . 



Middle Initial Family Name 
(Stace/Foreign Country) 



Grizenship 



Post Office Address 



Zip Code. 



11) Inventor's Signature 



Date_ 



Inventors Name (typed) 



First 



Residence (Gry) . 



Middle Initial Family Name 
(State/Foreign Country) 



Grizenship 



Post Office Address 



Zip Code- 



12) Inventor's Signature 



Date- 



Inventor's Name (typed) 



Firsr 



Residence (Gry) , 



Middle Initial Family Name 
„ (State/ Foreign Country) 



Grizenship 



Post Office Address 



Zip Code. 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Patent Application of 

YOKOIetal Atty.Ref.: 249-118 

Serial No. to be assigned Group: 1646 

Filed: October 6, 2000 Examiner: Mertz 

For: HG-CSF FUSION POLYPEPTIDE HAVING C-MPL 

ACTIVITY, DNA CODING FOR SAME AND METHODS 
OF TREATING ANEMIA USING SAME(AS AMENDED) 



October 6, 2000 



Assistant Commissioner for Patents 
Washington, DC 20231 



Sir: 

LETTER RE SEQUENCE LISTING 

Pursuant to Rule 821(e) the applicants note the computer readable copy of the 
Sequence Listing of the present application is identical to the computer readable form of 
the Sequence Listing in the parent application Serial No. 08/765,337. The applicants 
request that the computer readable copy of the Sequence Listing from application Serial 
No. 08/765,337 be used in the present application. 

Respectfully submitted, 
NIXON & VANDERHYE P.C. 



By: 




Arthur R/Crawford 
Reg. No. 25,327 



ARC:pfc 

1 100 North Glebe Road, 8th Floor 
Arlington, VA 22201-4714 
Telephone: (703) 816-4000 
Facsimile: (703) 816-4100 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Atty. Docket No: 249-89 

In re patent application of 
YOKOI, Haruhiko et al 
Serial No. 08/765,337 
Filed: December 23, 1996 
For: NOVEL POLYPEPTIDES 

STATEMENT TO SUPPORT FILING AND SUBMISSION IN 
ACCORDANCE WITH 37 C.F.R. §§ 1.821-1.325 



Assistant Commissioner for Parents 
Washington, D.C. 20231 
Box SEQUENCE 

Sir: 

In connection with a Sequence Listing submitted concurrently 
herewith, the undersigned hereby states that: 

1. the submission, filed herewith in accordance with 37 
C.F.R. § 1.821(g)/ does not include new matter; 

2 . the content of the attached paper copy and the 
attacned computer readable copy of the Sequence Listing, submitted in 
accordance with 37 C.F.R. § 1.821(c) and (e) , respectively, are the same; 
and 

3 . all statements made herein of their own knowledge are 
true and that all statements made on information and belief are believed to 
be true; and further, that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine 
or ir^risonment, or both, under Section 1001 of Title 18 of the United 



Serial No. 08/765,337 

States Code and that such willful false statements may jeopardize the 
validity of the application or any patent resulting therefrom. 



Intellectual Property Services 
1500A Lafayette Road 
Suite 2 62 
Portsmouth, N.Hi 
800-318-3021 



Respectfully submitted, 
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APPENDIX 



• Original SEQ ID NOS 1 -3 remain the same; 

• Original SEQ ID NOS 4-6 are now SEQ ID NOS 4-5, 6-7 & 8-9, respectively; 

• Original SEQ ID NO: 7 is now SEQ ID NO: 10; 

• Original SEQ ID NO: 8 is now SEQ ID NO: 1 1 ; 

• Original SEQ ID NO: 9 is now SEQ ID NO: 1 2; 

• Original SEQ ID NO: 10 is now SEQ ID NO:13; 

*Please note that the part or all of the CDS regions for original SEQ ID NOS 8-10 were either incorrect 
or contained internal stop codons; since the amino acids were not explicitly listed in application, these 
coding regions were left out of the corrected Sequence Listing. 

• Original SEQ ID NO: 1 1 is now SEQ ID NOS 14-15; 

• Original SEQ ID NO: 12 is now SEQ ID NOS 16-17; 

• Original SEQ ID NO: 13 is now SEQ ID NOS 18-19; 

• Original SEQ ID NO: 14 is now SEQ ID NOS 20-21 ; 

• Original SEQ ID NOS 15-17 are now SEQ ID NOS 22-24; 

• Original SEQ ID NOS 18-19 are now SEQ ID NOS 25-26 & 27-28, respectively; 

• Original SEQ ID NO: 20 is now SEQ ID NO:29; 
*See above note re; SEQ ID NOS 8-10. 

• Original SEQ ID NOS 21-22 are now SEQ ID NOS 30-3 1 & 32-33, respectively; 

• Original SEQ ID NO: 23 is now SEQ ID NO:34; 
*See above note re: SEQ ID NOS 8-10. 

• Original SEQ ID NO: 24 is now SEQ ID NO:35; 

• Original SEQ ID NO: 25 is now SEQ ID NOS 36-37; 
*See above note re: SEQ ID NOS 8-10. 

• Original SEQ ID NO: 26 is now SEQ ID NO:38; 
*See above note re: SEQ ID NOS 8-10. 

• Table 1 encompasses SEQ ID NO:39; 

• Table 3 encompasses SEQ ID NO:40; 

• Table 4 encompasses SEQ ID NOS 24, 35 & 41-45, respectively. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Yokoi, Haruhiko 

Shiotsu, Yukimasa 
Konishi, Noboru 

(ii) TITLE OF INVENTION: NOVEL POLYPEPTIDES 

(iii) NUMBER OF SEQUENCES: 4 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 North Glebe Rd. 8th floor 

CC) CITY: Arling-on 

CD) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22201-4~41 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE; Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Parent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/765,337 

(B) FILING DATE: 23-DEC-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/ JP96/01157 

(B) FILING DATE: 26-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP P . HEI . 7-102 625 

(B) FILING DATE: 2 6-APR-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Crawford, Arthur R. 

(B) REGISTRATION NUMBER: 25,327 

(C) REFERENCE /DOCKET NUMBER: 249-89 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-816-4000 

(B) TELEFAX: 703-816-4100 



(2) INFORMATION FOR SEQ ID NO;l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 arrJ.no acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
1 5 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asd He Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala Kis Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala Pro Thr Tyr Arg Ala 
145 150 " 155 160 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
165 170 175 

Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
180 185 190 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
195 200 205 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
210- 215 220 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
225 230 235 240 

Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
245 250 255 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
260 265 270 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
275 280 285 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
290 295 300 
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Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 

305 310 315 320 

Val Leu Arg His Leu Ala Gin Pro 
325 

INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
1 5 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 " 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly Gly Ser Gly Gly Gly 
145 150 155 160 

Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg Ala Ser Ser Leu Pro 
165 " 170 175 

Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg Lys lie Gin Gly 
180 185 190 

Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 
195 200 205 
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His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly lie Pro Trp 
210 215 220 



Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
225 230 235 240 

Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
245 250 255 

Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu 
260 265 270 

Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp Gin Gin Met Glu 
275 280 285 

Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro 
290 295 300 

Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala 
305 310 315 320 

Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His 
325 330 335 

Leu Ala Gin Pro 
340 

INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 4 amino acids 
{ B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pepride 



(Xi) SEQUENCE DEi 

Ser Pro Ala Pro 
1 

Arg Asp Ser His 

20 

His Pro Leu Pro 
35 

Gly Glu Trp Lys 
50 

Gly Ala Val Thr 
65 

Leu Gly Pro Thr 



Val Arg Leu Leu 
100 



CRIPTION: SEQ i: 

Pro Ala Cys Asp 
5 

Val Leu His Ser 



Thr Pro Val Leu 
40 

Thr Gin Met Glu 
55 

Leu Leu Leu Glu 
70 

Cys Leu Ser Ser 
85 

Leu Gly Ala Leu 



) NO: 3: 

Leu Arg Val Leu 
10 

Arg Leu Ser Gin 
25 

Leu Pro Ala Val 



Glu Thr Lys Ala 
60 

Gly Val Met Ala 
75 

Leu Leu Gly Gin 
90 

Gin Ser Leu Leu 
105 



Ser Lys Leu Leu 
15 

Cys Pro Glu Val 
30 

Aso Phe Ser Leu 
45* 

Gin Asp lie Leu 



Ala Arg Gly Gin 
80 

Leu Ser Gly Gin 
95 

Gly Thr Gin Leu 
110 
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Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly Gly Gly Ser Gly Gly 
145 150 155 160 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala Pro Thr Tyr Arg Ala 
165 170 175 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
180 185 190 

Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
195 200 205 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
210 215 220 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
225 230 235 240 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
245 250 ~ 255 

Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
260 265 270 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
275 280 ~ 285 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
290 295 300 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
305 310 315 320 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
325 330 335 

Val Leu Arg His Leu Ala Gin Pro 
340 

INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 1 . . S3 
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(ix) FEATURE: 

(A) NAME /KEY: mat^peptide 

(B) LOCATION: 64.. 1047 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1047 



(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 4: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 4 8 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -io 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

~ 5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 14 4 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 24 0 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 

60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 38 4 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 432 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
HO 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 4 80 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG CGG GCG 528 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ara Ala 

1^0 145 150 " 155 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 57 6 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 

160 165 170 
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TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG 
Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
175 180 185 

AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 
Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
190 195 200 

CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 
Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
205 210 215 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
220 225 230 235 

CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC 
Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
240 245 250 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
255 260 265 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC 
Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
270 275 280 

CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG 
Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
285 290 295 

CGC CGG GCA GGA GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG 
Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
300 305 310 315 

GAG GTG TCG TAC CGC GTT CTA CGC CAC CTT GCC CAG CCC 
Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
320 325 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 
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Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 



Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala 
140 145 150 155 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
160 165 170 

Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
175 180 135 

Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
190 195 200 

Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
205 210 215 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
220 225 230 235 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
240 245 250 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
255 * 260 265 

Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
270 275 230 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
285 290 295 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
300 305 310 315 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
320 325 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "syntheric DNA" 



(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 1..63 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 64.. 1083 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1083 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT OCT CCT GCT TGT GAC CTC CGA GTC 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 
Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 
Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 
Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 
Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 
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CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 
Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTC AGG GGT GGC 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 
140 145 150 155 

GGT TCT GGA GGT GGT TCC GGA GGG GGT TCT AGA GCA CCA ACA TAT CGC 
Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 
160 165 170 

GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC TTA GAG CAA GTG 
Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
175 180 " 185 

AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC 
Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
190 195 200 

ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC TCT 
Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
205 210 215 

CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC AGC CAG GCC CTG 
Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 

220 225 230 235 

CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC 
Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
240 245 250 

CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC 
Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro 
255 260 265 

ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 
Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie 
270 275 280 

TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG CAG CCC ACC 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 
285 290 295 

CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG CGC CGG GCA GGA 
Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 
300 305 310 315 

GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG GAG GTG TCG TAC 
Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 
320 325 330 

CGC GTT CTA CGC CAC CTT GCC CAG CCC 
Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 61 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 " 20 25 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 55 9C 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 
140 145 150 * 155 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 
160 165 170 

Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
175 180 185 

Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
190 195 200 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
205 210 215 

Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu 
220 225 230 235 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
240 245 250 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro 
255 260 265 
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Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr lie 

270 275 280 



Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 
285 290 295 

Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 
300 305 310 315 

Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 
320 325 330 

Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

{A) DESCRIPTION: /desc » "synthetic DNA" 



(ix) FEAT ORE : 

(A) NAME / KEY : mat_peptide 
(3) LOCATION: 64.. 10 95 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1095 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 48 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 144 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 240 
Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 
Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 * 65 70 75 
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GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 
Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 432 
Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 480 
Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG TCC GGA 528 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

GGT GGC TCT GGC GGT GGT TCT GGT GGC GGC TCC GGA GGC GGT CGT GCG 57 6 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 624 
Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
175 130 185 

TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG 672 
Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
190 195 200 

AAG CTG TGT GCC ACC TAG AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 720 
Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
205 210 215 

CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 768 
Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 
220 225 230 235 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC 816 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
240 245 250 

CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC 864 
Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro 
255 260 265 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 912 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
270 " 275 280 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC 960 
Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
285 290 295 

CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG 1008 
Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
300 305 310 315 
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CGC CGG GCA GGA GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG 
Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 
320 325 330 

GAG GTG TCG TAC CGC GTT CTA CGC CAC CTT GCC CAG CCC 
Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
-21 -20 -15 -10 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15' 10 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 . 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
175 180 185 

Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
190 195 200 
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Lys Leu Cys Ala Thr Tyr Lys Leu Cys His 
205 210 



Pro Glu Glu Leu Val Leu 
215 



Leu Gly His Ser Leu Gly lie Pro Trp Ala 
220 225 



Pro Leu Ser Ser Cys Pro 
230 235 



Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu 
240 245 



Ser Gin Leu His Ser Gly 
250 



Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala 
255 260 



Leu Glu Gly lie Ser Pro 
265 



Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin 
270 * 275 



Leu Asp Val Ala Asp Phe 
280 



Ala Thr Thr He Trp Gin Gin Met Glu Glu 
235 290 



Leu Gly Met Ala Pro Ala 
295 



Leu Gin Pro Thr Gin Gly Ala Met Pro Ala 
300 305 



Phe Ala Ser Ala Phe Gin 
310 315 



Arg Arg Ala Gly Gly Val Leu Val Ala Ser 
320 325 



His Leu Gin Ser Phe Leu 
330 



Glu Val Ser Tyr Arg Val Leu Arg His Leu 
335 " 340 



Ala Gin Pro 



(2) INFORMATION FOR 3EQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A} DESCRIPTION: /desc - "synthetic DNA" 



(ix) FEATURE: 

(A) NAME / KEY: sig_?eptide 
(3) LOCATION: 27.. 44 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTCTCCAAGC TTGAATTCCG GCCAGAATGG AGCTGACTGA ATTG 4 4 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ~ "synthetic DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTAGAGGTAC CGCGGCCGCT TACCCTTCCT GAGACAGATT CTGGGAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc *= "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGAACCTCTG GGCACTGGCT CAGT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GCTGCCTGCT GTGGACTTTA GCTT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
{B} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
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TGT TGG AAG CTC AGG AAG ATG GCA 
Cys Trp Lys Leu Arg Lys Met Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Trp Lys Leu Arg Lys Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



fix} FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCT GAT GCT TGT AGG AGG GTC CAC 
Pro Asp Ala Cys Arg Arg Val His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Pro Asp Ala Cys Arg Arg Val His 
1 5 

(2) INFORMATION FOR SEQ ID SO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 



(ix) FEATURE: 

{A) NAME /KEY : CDS 
(B) LOCATION: 1 . . 24 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCA AGA GTT CGT GTA TCC TGT TCA 
Ser Arg Val Arg Val Ser Cys Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I9: 

Ser Arg Val Arg Val Ser Cys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
<B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

GAA TGG AAC TCG TGG ACT CTT TCC 
Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc * "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC;22: 
GTAAAACGAC GGCCAGT 17 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CAGGAAACAG CTATGAC " 17 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: join{1..3, 43.. 66} 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TGC TCTAGAACCG CCTCCGGAAC CACCTCCAGA ACCGCCACC CCT GAC GCA GAG 54 
Cvs Pro Asp Ala GIu 

"l 5 

GGT GGA CCC TCC 66 
Gly Gly Pro Ser 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Cys Pro Asp Ala Glu Gly Gly Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



[ix) FEATURE: 

(A) NAME /KEY: CDS 
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(B) LOCATION: 22. .45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GGTTCCGGAG GCGGTTCTAG A GCA CCA ACA TAT CGC GCC TCG AGT 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 

12) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
CAT TCCGC GG GGTACCGCGG CCGCTCAGGG CTGGGCAAGG TGGCGTAG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
(3) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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GGC TGC TTG AGC CAA CTC CAT AGC 
Gly Cys Leu Ser Gin Leu His Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Cys Leu Ser Gin Leu His Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : or her nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GAC CCA ACT CGG GGG AGA TCC CTT 
Asp Pro Thr Arg Gly Arg Ser Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asp Pro Thr Arg Gly Arg Ser Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace (25, "") 

(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace (33. .34, "") 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TAGACTCGAG GC GC GAT AT G TTGGCGCCCG CCGTACGCAG AGGGTGGACC CTCCTAC 57 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amine acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Ser Gly Gly Giy Ser Glv Gly Giy Ser Gly Gly Gly Ser Gly Gly Gly 
1 5 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1.-6 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 7 . . 37 
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(D) OTHER INFORMATION: /product= "linker peptide" 



(ix) FEATURE: 

(A) NAME /KEY ; misc_f eature 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product^ "SplI" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 12 

(D) OTHER INFORMATION: /product* "Mrol" 

(ix) FEATURE: 

(A) NAME /KEY: misc^f eature 

(B) LOCATION: 43.. 48 

(D) OTHER INFORMATION: /product* "Mrol" 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_f eature 

(B) LOCATION: 58.. 61 

(D) OTHER INFORMATION: /product- "Bbel" 

(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace ( 4 . . 5 , "") 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GTA CGG TCCGGAGGTG GCTCTGGCGG TGGTTCTGGT GGCGGCTCCG GAGGCGG 
Val Arg 

1 

TGCGC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Val Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 1 
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(ix) FEATURE: 

(A) NAME /KEY: mi sc_f eature 

(B) LOCATION: 1..51 

(D) OTHER INFORMATION: /product= "linker peptide" 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 53 

(D) OTHER INFORMATION: /product^ "SplI" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 10.. 15 

(D) OTHER INFORMATION: /product= "Mrol" 

(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 4 6.. 51 

(D) OTHER INFORMATION: /product= "Mrol" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
ACGACCGCCT CCGGAGCCGC CACCAGAACC ACCGCCAGAG CCACCTCCGG ACC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pep-ide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Xaa Thr Pro Leu Gly Pre Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
15 10 15 

Lys Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu 
20 25 30 

Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Gly Leu 
35 40 45 

Val Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
50 55 60 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His 
65 70 75 80 

Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie 
85 90 95 

Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
100 105 110 
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Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
115 120 125 



Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
130 135 140 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser 

145 150 155 160 

Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
165 " 170 175 

INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
1 5 ' * 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pre Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45* 

Gly Glu Trp Lys Thr Gin Mer Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 110 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala lie Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala Pro Pro Thr Thr Ala 
145 150 155 160 

Val Pro Ser Arg Thr Ser Leu Val Leu Thr Leu Asn Glu Leu Pro Asn 
165 170 175 
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Arg Thr Ser Gly Leu Leu Glu Thr Asn Phe Thr Ala Ser Ala Arg Thr 
180 185 190 

Thr Gly Ser Gly Leu Leu Lys Trp Gin Gin Gly Phe Arg Ala Lys lie 
195 200 205 

Pro Gly Leu Leu Asn Gin Thr Ser Arg Ser Leu Asp Gin lie Pro Gly 
210 215 220 

Tyr Leu Asn Arg lie His Glu Leu Leu Asn Gly Thr Arg Gly Leu Phe 
225 230 235 240 

Pro Gly Pro Ser Arg Arg Thr Leu Gly Ala Pro Asp lie Ser Ser Gly 
245 250 255 

Thr Ser Asp Thr Gly Ser Leu Pro Pro Asn Leu Gin Pro Gly Tyr Ser 
260 265 270 

Pro Ser Pro Thr His Pro Pro Thr Gly Gin Tyr Thr Leu Phe Pro Leu 
275 280 285 

Pro Pro Thr Leu Pro Thr Pro Val Val Gin Leu His Pro Leu Leu Pro 
290 295 300 

Asp Pro Ser Ala Pro Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn Thr 
305 310 315 320 

Ser Tyr Thr His Ser Gin Asn Leu Ser Gin Glu Gly 
325 330 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Gly Gly Gly Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly 
15 10 15 



INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ser Gly Gly Gly 
1 

INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
15 10 

INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 
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